METHODS FOR PROTEIN ANALYSES AND PROTEIN EXPRESSION IN 
MULTICELLULAR ORGANISMS AND CELLS THEREFROM 

5 This application claims priority to U.S. provisional application Serial No, 

60/390,872 filed June 20, 2002. 

This application was supported by a grant from the National Institutes of 
Health, No. ROl GM27925-22. The government may have certain rights in this 
invention. All patents, published patent applications and other references cited in the 
10 specification are incorporated herein in their entirety. 

BACKGROUND Op THE INVENTION 
1. Field of the Invention 

15 

[001] This invention relates to methods for regulatable gene expression, 
detection and analysis of protein sequences, and detection and analysis of protein- 
protein interactions and agonists and antagonists thereof in multicellular organisms 
or cells therefrom 

20 [002] This invention relates to interactions between biological molecules, 
particularly proteins, and methods for detecting and quantifying such interactions. 
The invention is particularly related to detection of protein-protein interactions that 
occur in the cellular cytoplasm of a cell from a multicellular organism. The invention 
further provides methods for detecting nuclear export sequences and nuclear - 

25 localization sequences. This invention also relates to methods of regulatable gene 
expression. More specifically, the invention provides methods for inducible 
production of proteins in multicellular organisms or cells therefrom, including protein 
production from exogenously introduced GAL gene promoters, with or without 
galactose as the inducing molecule. 
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2. Background of the Related Art 



1 



[003] The Human Genome Project has recently determined the complete human 
genetic sequence. From studies on only a very small fraction of the genome (likely 
less than 0.1%), it appears that protein-protein interactions most often comprise key 
mechanistic features of biological processes. Protein-protein interactions thus 
5 provide potential targets for therapeutic intervention in many disease states as well 
as manipulation of gene expression for any desired purpose. 

[004] Methods for detecting and analyzing of protein-protein interactions, as 
well as methods for achieving regulatable gene expression and detection and 
analysis of protein sequences, have been the focus of work based on several 

10 different native biological processes in a simple eucaryote, the yeast 

Saccharomyces cerevisiae. Such methods, which have generally employed native 
biological processes in S. cerevisiae, thereby using the yeast cell as the cellular 
platform, are recognized in the art as being useful in the biotechnology field, due in 
part to S. cerevisiae's genetic tractability and ease of biochemical and molecular 

15 manipulation. Thus, basic discoveries related to several native biological processes 
in S. cerevisiae have been exploited for detecting and analyzing protein-protein 
interactions and agonists and antagonists thereof, as well as methods of regulatable 
gene expression and detection and analysis of protein sequences in yeast cells. 
[005] An example of a system that has been exploited for detecting and 

20 analyzing protein-protein interactions and protein sequences and methods of 

regulatable gene expression in S. cerevisiae is the Gal3p-Gal80p-Gal4p gene switch, 
also known as the Y-GAL gene switch. The Y-GAL gene switch has been used in 
many biotechnology applications (United States Patent 6,221,630 to Hopper; Mylin 
LM, et al.. Methods Enzymol. 1990; 185:297-308; Cook JO, et al.. Protein Expr 

25 Purif. 1999 Dec; 17(3): 477-84; Ferreira BS, et al., AppI Microbiol Biotechnol. 2003 
Mar; 61(1): 69-76; Panuwatsuk W, et al., Biotechnol Bioeng. 2003 Mar 20; 81(6): 
712-8; Hwang WZ, et al., J Agric Food Chem. 2001 Oct; 49(10): 4662-6; Brown R, 
et al.. Enzyme Microb Technol. 2000 Jun 1; 26 (9-10): 801-807; Yevenes A, et al.. 
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Biochimie. 2000 Feb; 82(2): 123-7; Chen S, et al., Protein Expr Purif. 1999 Dec; 
17(3): 414-20; Hansen MK, et al., Receptors Channels. 1999; 6(4): 271-81; Hinz W, 
et al., FEBS Lett. 1999 Apr 1; 448(1): 57-61; Hayden MS, et a!., Protein Expr Purif. 
1998 Mar; 12(2): 173^84; Gerik KJ, et a!., J Biol Chem. 1997 Jan 10; 272(2): 1256- 
5 62; Broder, ET, et al„ Methods in Enzymology 1999 328: 430443; Mitchell, DA, et 
a!., 1993 Yeast 9: 715-723; Romanos, MA, etal.. Yeast 8: 423488). The Y-GAL 
gene switch is connprised of three proteins: i) a site-specific DNA binding 
transcriptional activator (Gal4p) that is tightly associated with its cognate DNA site 
(the UASgal site) in the promoter regions of galactose-regulatable genes; ii) GalSOp, 

10 a protein that binds tightly to the transcriptional activation domain of Gal4p (amino 
acids 768-881) and inhibits Gal4p's capacity to recruit general transcription factors 
and RNA polymerase in the absence of galactose; and iii) Gal3p, a protein that binds 
to Gal80p in the presence of galactose and relieves Gal4p from Gal80p inhibition. 
Accordingly, Gal4p-mediated target gene expression occurs only in the presence of 

15 galactose (Johnston, M, 1987 Microbiol. Rev. 51:458476). 

[006] The Y-GAL gene switch is a widely used tool for galactose-regulatable high 
level amplified expression of heterologous proteins in yeast (Mylin LMj et al., 
Methods Enzymoi. 1990; 185:297-308; Cook JC, etal.. Protein Expr Purif. 1999 
Dec; 17(3): 477-84; Ferreira BS, et al., AppI Microbiol Biotechnol. 2003 Mar; 61(1): 

20 69-76; Panuwatsuk W, et al., Biotechnol Bioeng. 2003 Mar 20; 81(6): 712-8; Hwang 
WZ, et al., J Agric Food Chem. 2001 Oct; 49(10): 4662-6; Brown R, et al.. Enzyme 
Microb Technol. 2000 Jun 1; 26 (9-10): 801-807; Yevenes A, et al., Biochimie. 2000 
Feb; 82(2): 123-7; Chen S, et al., Protein Expr Purif. 1999 Dec; 17(3): 414-20; 
Hansen MK, et al., Receptors Channels. 1999; 6(4): 271^1; Hinz W, et al., FEBS 

25 Lett. 1999 Apr 1; 448(1): 57^61; Hayden MS, et al., Protein Expr Purif. 1998 Mar; 
12(2): 173-84; Gerik KJ, et al., J Biol Chem. 1997 Jan 10; 272(2): 1256-62; Broder, 
ET, et al.. Methods in Enzymology 1999 328: 430443; Mitchell, DA, et al., 1993 
Yeast 9: 715-723; Romanos, MA, et al., Yeast 8: 423488). More recently, some of 
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the instant inventors developed methodologies capable of detecting and 
characterizing protein-protein interactions, nuclear export and nuclear import 
sequences and galactose-independent inducibility of Gal4p-mediated gene 
expression in yeast by exploiting particular principles of the Y-GAL gene switch (see 

5 co-owned and co-pending U.S. Patent Application No. 10/165,873, incorporated 
herein in its entirety). 

[007] The protein-protein interaction method of Y- GAL gene switch 
methodologies is referred to as the "80-Trap method" and differs significantly from 
classical two-hybrid methods that depend on an intra-nuclear location of both 

10 interacting proteins. Specifically, in the 80-Trap method, the protein-protein 

interaction occurs within the cytoplasm but is reported (detected) by gene activation 
in the nucleus. Other methods provided in the Y-GAL gene switch methodologies are 
useful for detecting and analyzing nuclear export and nuclear import sequences. 
Additional methods provide a galactose-independent method of inducing Gal4p- 

15 mediated target gene expression. This last method provides for Gal4p-mediated 
activation of desired target genes by use of any low molecular weight, cell-permeant 
molecule that is a known effector of protein-protein interaction. 

[008] The methods provided, using the Y-GAL gene switch, exploit the fact that 
Gal4p's activation of target genes Is regulated by GalSOp, a protein that binds to the 

20 Gal4p transcriptional activation domain and prevents it from recruiting general 

transcription factors required for recruiting RNA polymerase to the Gal4p-regulatable 
target promoter. These methods additionally take advantage of either one or more 
of three heretofore unappreciated mechanistic features of the Y-GAL gene switch. 
These newly-recognized mechanistic features are that i) Gal3p need not enter tiie 

25 nucleus to form a complex with the Gal80 protein in the presence of galactose; ii) 
when GalSOp, a nuclear/cytoplasmic shuttling protein, shuttles out of the nucleus it 
binds to cytoplasmically-restricted GaISp; and, iii) binding of GalSOp to GaISp in the 
cytoplasm reduces the amount of GalBOp in the nucleus available to bind to UASgal- 
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associated Gal4p at the gene promoter and, as a consequence, Gal4p-mediated 
transcriptional activation of Gal4p-regulated genes is increased. Tlius, galactose- 
triggered interaction of GalSOp and Gal3p in the cytoplasm traps GalSOp in the 
cytoplasm and reduces the concentration of GalSOp in the nucleus. Consequently, 
5 the amount of GalSOp bound to Gal4p decreases, which results in Gal4p-mediated 
gene activation. 

[009] Unlike in yeast, there are very few existing methods for detecting and 
analyzing protein-protein interactions and agonists and antagonists thereof or 
regulatable gene expression in multicellular organisms or cells therefrom. Moreover, 

10 existing methods suffer from serious limitations. For example, one method to 
activate promoters in multicellular organisms or cells therefrom utilizes the Gal4 
protein, the UAScAL-specific transcriptional activator of yeast referred to above. 
However, Gal4p expression in multicellular organisms or cells therefrom causes 
constitutive activation of target promoters containing the UASgal site (Sadowski, I, 

15 Genetic Engineering, 1995 17:119-147; Horikoshi M, et al.. Cell. 19S8 Aug 26; 
54(5): 665-9; Webster N, et al.. Cell. 1988 Jan 29; 52(2): 169-78; Brand AH, et al.. 
Development. 1993 Jun; 1 18(2): 401-15; Scott K, et al., Cell. 2001 Mar 9; 104(5): 
661-73; Hrdlicka L, et al.. Genesis. 2002 Sep-Oct; 34(1-2): 51-7). A method to 
regulate Gal4p-mediated target gene expression in mammalian calls has been 

20 developed that takes advantage of the FLP recombinase target sequence (FRT) and 
recombination by the FLP recombinase. This method involves engineering into the 
same cells an FRT-associated GAL80 gene that can be excised by the FLP 
recombinase when the expression of the recombinase gene is induced by the heat 
shock promoter (Lee T, et al.. Trends Neurosci. 2001 May; 24(5): 2514). This 

25 system has numerous significant problems, among them the heat shock method of 
induction and the extended persistence of the GalSO protein, which can remain in the 
cells for as long as 48 hours after recombination excision of the GALSO gene 
sequences. 



5 



[010] Another existing nnethod used in mammalian cells is the Gal4p-based two- 
hybrid system for analysis of protein-protein interactions (Sadowski I, Genetic 
Engineering, 1995 17: 119-147; Sadowski I, Anal Biochem. 1998 Feb 15; 256(2): 
245-7; Feng, Xin-Hua, et al., Two-Hybrid Systems 2001 177: 222-239. Humana 
5 Press, Edited by Paul N. MacDonald; Shioda T, et al., Proc Natl Acad Sci USA. 
2000 May 9; 97(10): 5220-4; Rojo-Nlersbach E, et a!., Biochem J. 2000 Jun 15; 
348 Pt 3: 585-90; Buchert M, et al., Biotechniques. 1997 Sep; 23(3): 396-8, 400, 
402; Luo Y, et aL, Biotechniques. 1997 Feb; 22(2): 350-2). One serious limitation of 
two-hybrid methods in multicellular organisms or cells therefrom Is they are not 

10 applicable for a very large number of proteins including, but not limited to, most 
transcriptional activators, transcriptional repressors, RNA polymerase II components, 
components of the general (basal) transcription machinery, and proteins that are 
associated with chromatin or participate in chromatin remodeling. This limitation 
stems from the existing two-hybrid methods' detection of protein-protein interaction 

15 by activating reporter genes through either direct binding to the RNA polymerase or 
through binding to other proteins that in turn bind to an RNA polymerase subunit. 
Another serious limitation of existing methods is that they require that the relevant 
protein-protein interactions and the detection thereof take place in the nucleus at the 
reporter gene promoter. Thus, a protein that normally does not enter the nucleus is 

20 unlikely to participate in its normal protein interaction(s) if targeted to and 
sequestered In the nucleus. For example, post-translatlonal modification (e.g., 
phosphorylation) of cytoplasmic proteins is often mediated by interaction with one or 
more cytoplasmically-confined proteins; this process is illustrated in many well- 
established membrane-based receptor signaling cascades wherein proteins are often 

25 tethered to membranes or In close proximity to membrane-associated proteins. 
Thus, direct targeting to the nucleus of a protein that normally would be post- 
translationally modified in the cytoplasm would prevent or severely hinder such 
modifications required for normal biological activity. Therefore, existing methods in 
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the art requiring nuclear localization of both interacting proteins are not useful for 
detecting a significant fraction of protein-protein interactions that normally occur 
within the cell. 

[01 1] Given the limitations of existing analytical systems in multicellular 
5 organisms and cells therefrom, there is a need in the art for methods that provide a 
capacity to detect and analyze protein-protein interactions and agonists and 
antagonists thereof, detect and analyze protein sequences, and for regulatable gene 
expression in multicellular organisms or cells therefrom, including non-mammalian 
cells such as, for example, plant cells, particularly wherein such methods detect 

10 protein-protein interactions in the cytoplasm. Considering the highly regulated nature 
of most biologically-interesting genes studied to date and the fact that the estimated 
number of protein-coding sequences in the human genome will turn out to be only 
25,000 to 30,000, it is important not to exclude a large fraction of genes from 
protein-protein interaction analyses, as occurs using the methods known in the art. 

15 In view of the large number of genetic sequences being determined, there is a need 
in the art for methods of identifying and characterizing the properties of the protein 
products encoded thereby in multicellular organisms (and cells thereof), including 
vertebrates, invertebrates and plants. 

20 SUMMARY OF THE INVENTION 

[012] The invention provides methods for detecting and analyzing protein- 
protein interactions and agonists and antagonists thereof, detecting and analyzing 
protein sequences, and producing regulatable gene expression, as well as reagents 
25 for performing said methods. 

[013] In a first aspect, the invention provides a method of detecting protein- 
protein interactions in a host cell cytoplasm. The method provides a first fusion 
protein between a first protein or protein-binding fragment thereof and a transcription 
inhibitor that can partition or shuttle between the cell nucleus or cytoplasm, and a 
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second fusion protein between a second protein or protein-binding fragment thereof 
and a cytoplasmic membrane localization factor, wherein transcription of a gene 
operably linked to a promoter that is sensitive to or regulated by said transcription 
inhibitor is detected when the first fusion protein binds to the second fusion protein. 
5 In certain embodiments, the first protein or protein-binding fragment, the second 
protein or protein-binding fragment, or both the first and second protein or protein- 
binding fragment is encoded by a cDNA, for example, coding for a known protein or 
protein binding fragment, or a member of a cDNA library, i.e., an unknown protein or 
protein binding fragment. In a preferred embodiment, the cytoplasmic membrane 

10 localization factor is a cell membrane myristoylation signal or a mitochondrial 

membrane localization signal. In certain embodiments, the myristoylation sequence 
can be MGCTVSTQTIGDESDP (SEQ ID N0:1) or multicellular species-specific 
homolog thereof. Similarly, in certain embodiments, the mitochondrial outer 
membrane targeting sequence can be the N-terminal sequence of Tom70/Mas70 

15 protein, MKSFITRNKTAILATVAATGTAIGAYYYY (SEQ ID N0:3), or multicellular species- 
specific homolog thereof. 

[014] The invention also provides a first recombinant expression construct 
encoding the first fusion protein and a second recombinant expression construct 
encoding the second fusion protein, as well as cells, preferably cells from a 

20 multicellular organism, and most preferably human cells, comprising the first 

recombinant expression construct, the second recombinant expression construct or 
both said first and second recombinant expression constructs. In certain 
embodiments, said cells from a multicellular organism also comprise a gene 
operably linked to {inter alia, transcribed from) a promoter, wherein transcription 

25 from the promoter is sensitive to or regulated by inhibition by the transcription 

inhibitor comprising the first fusion protein. The product of the gene operably linked 
to the promoter that is sensitive to or regulated by inhibition by the transcription 
inhibitor is optionally a detectable product. Similarly, the gene operably linked to a 



promoter that is sensitive to or regulated by said transcriptional inhibitor can be a 
selectable gene, wherein increased expression of said gene confers a growth 
advantage on the cell or distinguishes the cell in some detectable manner. When the 
operably-linked gene is a selectable gene, the methods of this embodiment can 
5 further comprise placing the host cell under selective growth conditions and 
detecting increased growth or survival of said cells under said selective growth 
conditions, wherein said protein-protein interaction is detected thereby. 

[015] In other related embodiments, the first protein or protein-binding 
fragment, second protein or protein-binding fragment, or both first and second 

10 protein or protein-binding fragment can be a transcriptional activator protein or one 
of a multiplicity of proteins that participate in protein-protein interactions to bring 
about transcriptional activation. The first protein or protein-binding fragment, second 
protein or protein-binding fragment, or both first and second protein or protein- 
binding fragment can also be detectable or produce detectable metabolites. In other 

15 embodiments, the gene operably linked to a promoter that is sensitive to or 
regulated by said transcriptional inhibitor can be a gene expressed from a GAL4 
protein activatable promoter, optionally one of a multiplicity of promoters that 
contain a UASgal site, and the gene expressed from the GAL4 protein activatable 
promoter can be one of a multiplicity of genes that encode detectable proteins. In 

20 additional related embodiments, the transcription inhibitor can be GalSOp. 

[016] In a second aspect, the invention provides methods for isolating the first 
or second fusion proteins of the above embodiments, the method comprising the 
steps of detecting increased expression of the gene operably linked to a promoter 
that is sensitive to or regulated by said transcriptional inhibitor; and isolating said 

25 first or second fusion proteins. As above, the first fusion protein, second fusion 
protein, or both the first and second fusion proteins can be encoded by one or a 
plurality of members of a cDNA library. 
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[017] In a third aspect, the invention provides methods for detecting protein- 
protein interactions in the cytoplasm of a cell of a multicellular organism, the 
methods comprising the steps of introducing into a cell from a multicellular organism 
a first recombinant expression construct encoding a first protein or protein-binding 

5 fragment thereof fused with the amino- or carboxyl- terminus of GalSOp; and 
introducing into the cell from a multicellular organism a second recombinant 
expression construct encoding a second protein or protein-binding fragment thereof 
fused to a cytoplasm localization sequence, wherein upon interaction of the first and 
second proteins in the cell cytoplasm, said GalSOp is localized to the cytoplasm, 

10 wherein transcription of a gene operably linked to a promoter that is sensitive to or 
regulated by said GalSOp is increased; and detecting said increased transcription of 
said gene, wherein said protein-protein interaction is detected thereby. As in other 
aspects of the invention, the first and/or second protein or protein-binding fragment, 
in this aspect of the invention encoded by the first and second recombinant 

15 expression constructs, can be encoded by a cDNA, for example, coding for a known 
protein or protein-binding fragment, or a member of a cDNA library, i.e., an unknown 
protein or protein-binding fragment. 

[018] In the above aspects of the invention, as well as any aspects herein 
related to protein-protein interactions, when the first or second protein or protein- 

20 binding fragment thereof is a known protein, it is termed the "bait." The bait can be 
fused to either the transcriptional inhibitor, in preferred embodiments GalSO protein, 
or targeted to a membrane, for example, the cell plasma and vesicular membranes 
or the mitochondrial outer membrane, by fusion with multicellular organism-based or 
synthetic cell membrane targeting and anchoring sequences. The protein or protein- 

25 binding fragment being tested for protein-protein interactions with the known bait 
protein is termed the "prey." When the known bait is fused to the transcriptional 
inhibitor, e.g., GalSOp, the prey is targeted to one of the above-mentioned 
membranes. Similarly, the known bait can be targeted to one of the above- 



mentioned membranes, wherein the prey is fused to transcriptional inhibitor. 
Further, unknown proteins or protein-binding fragments can be used for fusion to the 
transcriptional inhibitor and for targeting to the above-mentioned membranes. In 
such embodiments, both fusions are technically prey; however, the present invention 
5 nevertheless enables detection of interactions between two such prey fusions. 
Interaction between the two fusion proteins traps the transcriptional inhibitor in the 
cytoplasm, which reduces the amount of transcriptional inhibitor available to 
suppress transcription of a coding sequence operably-connected to a promoter 
sensitive to or regulated by the transcriptional inhibitor. For example, interaction 

10 between a GalSOp fusion protein and membrane localization sequence-containing 
fusion protein traps the GalSOp in the cytoplasm. Consequently, less GalSOp is 
available to bind to any nuclear Gal4p transcription activation domains introduced or 
otherwise present in the cell from a multicellular organism, which, in turn, activates 
transcription from any GAL promoters in the nucleus, inducing the expression of GAL 

15 reporter genes. 

[019] In this third aspect, the invention also provides a first recombinant 
expression construct encoding the first fusion protein comprising the first protein or 
protein-binding fragment (known, i.e., bait, or unknown, i.e., prey, as described 
above) thereof fused to the GalSO protein, and a second recombinant expression 

20 construct encoding the second fusion protein comprising the second protein or 
protein-binding fragment (known, i.e., bait, or unknown, i.e., prey, as described 
above) thereof targeted to either the cell plasma and vesicular membranes or the 
mitochondrial outer membrane by fusion with multicellular organisnrhbased or 
synthetic cell membrane targeting and anchoring sequences or mitochondrial outer 

25 membrane targeting and anchoring sequences, respectively. In this aspect are also 
provided cells, most preferably cells from a multicellular organism, comprising the 
first recombinant expression construct, the second recombinant expression 
construct or both said first and second recombinant expression constructs. 



[020] Thus, the invention broadly provides methods for detecting proteiniDroteiri 
interactions through the introduction of a first and second recombinant expression ' 
constructs into a cell, most preferably a cell from a multicellular organism, and 
expression of the first and second fusion proteins therein. Expression of a second 
5 fusion protein that does not bind to the first fusion protein results in no increase in a 
gene operably linked to a promoter sensitive to or regulated by the transcription 
inhibitor comprising said first fusion protein. Expression of a second fusion protein 
that does bind to the first fusion protein results in an increase in expression of a 
gene, optionally a reporter gene whose gene product is detectable, operably linked 

10 to a promoter sensitive to or regulated by the transcription inhibitor comprising said 
first fusion protein, because said first fusion protein partitions preferably to the 
cytoplasm and thereby releases transcription inhibition of said gene. In certain 
embodiments, said first fusion protein comprises a particular protein species, i.e., a 
known protein or protein-binding fragment. In alternative embodiments, said first 

15 fusion protein comprises a plurality of members of a cDNA library of species, i.e., an 
unknown protein or protein-binding fragment, wherein the first fusion protein 
comprises a plurality of fusion proteins in which the transcription inhibitor protein 
(such as GalSOp) is fused to each of a plurality of members of said cDNA library in 
each species of fusion protein comprising said plurality. In a preferred embodiment, 

20 said cell is a cell from a multicellular organism. 

[021] This method permits screening for and characterization of protein 
interactions in ways not possible using existing conventional two-hybrid approaches. 
The method is advantageous for identifying protein interactions wherein one or both 
proteins is a transcriptional activator, transcriptional repressor, RNA polymerase II 

25 component, a component of the general (basal) transcription machinery, or one or 
more of the very large number of proteins identified as being associated with 
chromatin or involved in chromatin remodeling, including transcriptional repressors, 
and proteins requiring cytoplasmic modification for interaction, as well as any other 



protein species that can form a specific binding pair witli another protein species in 
the cytoplasna of the cell. This method is also advantageous for identifying protein 
interactions where one or both proteins require residence in the cytoplasm for native 
activity, the capacity of the inventive methods for targeting prey to different cellular 
5 environments (for example, plasma and vesicular membrane, outer mitochondrial 
membrane, and outer nuclear membrane and endoplasmic reticular membranes) 
within the cell from a multicellular organism is particularly advantageous in assays 
for certain protein interactions, such as for any proteins that have portions 
protruding into the cytoplasm. Additionally, semi-quantitative results comparing 

10 interactions between different protein pairs may be performed using the methods of 
this invention. The method also takes advantage of a wide range of existing two- 
hybrid reporters known in the art; moreover, the method can be performed in 
reverse two-hybrid or selection mode when, for example, the gene whose 
transcription is stimulated upon binding of the protein or protein-binding fragments of 

15 interest encodes a cytostatic or cytotoxic gene product. Interaction of the proteins 
or protein-binding fragments is identifiable through the death or growth cessation of 
the cells from a multicellular organism. 

[022] The invention also provides methods and reagents for detecting the 
presence of nuclear export sequences or nuclear import/localization sequences. In 

20 a fourth aspect, the invention provides a method for detecting in a host cell 

cytoplasm a nuclear export sequence (NES). The method comprises the steps of 
introducing into the cell a recombinant expression construct encoding a protein or 
protein-binding fragment containing putative NES fused with the amino- or carboxyl- 
terminus of a transcription inhibitor; assaying the cell for expression of a gene 

25 operably linked to a promoter that is sensitive to or regulated by said transcriptional 
inhibitor; and detecting increased expression of said gene, optionally a detectable 
product, wherein a NES is detected thereby. Similar to previous aspects of the 
invention, the recombinant expression consti'uct can comprise a cDNA or a member 
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of a cDNA library. In related embodiments, the gene that is operably linked to a 
promoter that is sensitive to or regulated by said transcriptional inhibitor can be a 
selectable gene, wherein increased expression of said gene confers a growth 
advantage or disadvantage on the cell or detectably distinguishes the cell. In such 

5 embodiments, the method for detecting in a host cell cytoplasm a nuclear export 
sequence (NES) can optionally comprise subjecting the host cell to selective growth 
conditions; and detecting increased or decreased growth or survival of said cells 
under selective growth conditions; wherein said NES is detected thereby. 

[023] In a fifth aspect, the invention provides a method for isolating a nuclear 

10 export sequence (NES) comprising detecting increased expression of a gene 
operably linked to a promoter that is sensitive to or regulated by GalSOp in a cell 
comprising a recombinant expression construct encoding a protein or protein-binding 
fragment containing a putative NES fused with the amino- or carboxyl-terminus of 
GalSOp; and isolating said NES from the protein fused to GalSOp. The gene operably 

15 linked to a promoter that is sensitive to or regulated by GalSOp can be a gene 
expressed from a GAL4 protein activatable gene promoter and can optionally 
encode a detectable protein. Likewise, the gene operably linked to a promoter that 
is sensitive to or regulated by GalSOp can be a selectable gene, wherein increased 
expression of said gene confers a growth advantage or disadvantage on the cell or 

20 distinguishes the cell in some detectable manner. In some embodiments, the GAL4 
activatable promoter can be one of a multiplicity of promoters that contain a UASqal 
site. The NES can comprises one or a plurality of members of a cDNA library, 
wherein the first fusion protein comprises a plurality of fusion proteins in which the 
transcription inhibitor protein (such as GalSOp) is fused to each of a plurality of 

25 members of said cDNA library in each species of fusion protein comprising said 
plurality, wherein certain of the members of the cDNA species comprise an NES. 

[024] When the gene operably linked to a promoter that is sensitive to or 
regulated by GalSOp is a selectable gene that confers a growth advantage or 



disadvantage on the cell, the methods of this aspect can further comprise subjecting 
the host cell to selective growth conditions, and detecting increased or decreased 
growth or survival of said cells under selective growth conditions, wherein said NES 
is detected thereby. 

5 [025] In another related embodiment, the invention provides a more general 
method for isolating a nuclear export sequence (NES). The method comprises the 
steps of detecting increased expression of a gene operably linked to a promoter that 
is sensitive to or regulated by transcriptional inhibition in a cell comprising a 
recombinant expression construct encoding a protein or protein-binding fragment 

10 containing a putative NES fused with the amino- or carboxyl-terminus of a 

transcription inhibitor that inhibits said promoter that is sensitive to or regulated by 
transcriptional inhibition; and isolating said recombinant expression construct 
comprising said NES from the cell. 

[026] In a sixth aspect, the invention provides' a method for detecting nuclear 

15 export sequences (NES) in a cytoplasm of a cell from a multicellular organism. The 
method comprises the steps of introducing a gene expressing a protein or fragment 
thereof fused with the amino- or carboxyl-terminus of GalSOp; assaying the cell for 
expression of a gene operably linked to a promoter that is sensitive to or regulated 
by GalSOp; and detecting increased expression of said gene, wherein an NES is 

20 detected thereby. In certain embodiments, the recombinant expression construct 
comprises a cDNA or a member of a cDNA library, wherein the first fusion protein 
comprises a plurality of fusion proteins in which GalSOp is fused to each of a plurality 
of members of said cDNA library in each species of fusion protein comprising said 
plurality, wherein certain of the members of the cDNA species comprise an NES. 

25 [027] In a seventh aspect, the invention provides a method for detecting a 

nuclear localization sequence (NLS) in a host cell comprising the steps of introducing 
into the cell a recombinant expression construct encoding a protein or protein- 
binding fragment containing a putative NLS fused with the amino- or carboxyl- 
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terminus of a transcription inhibitor; assaying the cell for expression of a gene 
operably linked to a promoter that is sensitive to or regulated by said transcriptional 
inhibitor; and detecting decreased expression of said gene, wherein an NLS is 
detected thereby. In certain embodiments of this aspect, the gene operably linked 
5 to a promoter that is sensitive to or regulated by said transcriptional inhibitor can 
encode a product that converts a non-toxic compound to a cytotoxic or cytostatic 
compound. The recombinant expression constructs of this aspect of the invention 
can comprise a cDNA or a member of a cDNA library, wherein the first fusion protein 
comprises a plurality of fusion proteins in which the transcription inhibitor protein 

10 (such as GalSOp) is fused to each of a plurality of members of said cDNA library in 
each species of fusion protein comprising said plurality, wherein certain of the 
members of the cDNA species comprise an NLS, 

[028] In the aspects of the invention that relate to NLS detection, any NLSs that 
are native to the transcriptional inhibitor can be deleted or mutated (i.e., rendered 

15 non-functional or of limited function) prior to fusion to putative NLS-containing 

proteins or protein-binding fragments. In such embodiments, basal-level expression 
of the gene operably linked to the promoter that is sensitive to or regulated by said 
transcriptional inhibitor can be greatly increased, thereby enhancing the signal of any 
non-native NLS introduced in the methods herein. Such an embodiment is 

20 particularly useful when the product of the gene operably linked to the promoter that 
is sensitive to or regulated by said transcriptional inhibitor encodes a product that 
can convert a non-toxic compound to a cytotoxic or cytostatic compound, or simply 
encodes a toxic or cytostatic product itself. The present of a NLS decreases the 
expression of a product that is cytotoxic or cytostatic, thus conferring a growth 

25 advantage on those cells that have had a functional NLS fused to the protein or 
protein-binding fragment. 

[029] In other embodiments of this aspect of the invention, expression of the 
gene operably linked to a promoter that is sensitive to or regulated by said 



transcriptional inhibitor can be inducible when the cell is contacted with an inducing 
agent. In certain embodiments, the transcriptional inhibitor is GalSOp and the 
inducing agent can be galactose. 

[030] In still other embodiments of this aspect of the invention, the gene 
5 operably linked to a promoter that is sensitive to or regulated by the transcriptional 
inhibitor to which a NLS is fused encodes a product that converts a non-toxic 
compound to a cytotoxic or cytostatic compound and wherein expression of the 
gene operably linked to a promoter that is sensitive to or regulated by said 
transcriptional inhibitor is inducible when the cell is contacted with an inducing agent, 

10 wherein decreased expression of the gene is detected by cell growth or survival 
when the cell is contacted with both the inducing agent and the cytotoxic or 
cytostatic compound. 

[031] In an eighth aspect, the invention provides a method for isolating a 
nuclear localization sequence (NLS), The method comprises the steps of detecting 

15 decreased expression of the gene operably linked to a promoter that is sensitive to 
or regulated by a transcriptional inhibitor and encoded by a recombinant expression 
construct encoding a NLS-containing protein or NLS-containing fragment thereof 
fused with the amino- or carboxyl-terminus of the transcription inhibitor; and isolating 
said recombinant expression construct comprising said NLS from the cell. 

20 [032] In a related ninth aspect, the invention provides a method for detecting a 
nuclear localization sequence (NLS) in a host cell comprising the steps of introducing 
into the cell a recombinant expression construct encoding a NLS-containing protein 
or NLS-containing fragment thereof fused with the amino- or carboxyl-terminus of 
GalSOp; assaying the cell for expression of a gene operably linked to a promoter that 

25 is sensitive to or regulated by GalSOp; and detecting decreased expression of said 
gene, wherein an NLS is detected thereby. In certain embodiments, one or more of 
the native GalSOp NLSs are deleted or mutated {i.e., rendered non-functional) prior to 
fusion to putative NL$-containing protein or protein-binding fragments. Further, in a 



native-NLS-free background, sequences that act as NLSs or NESs can increase the 
nuclearity (in the case of a sequence acting as a NLS) or cytoplasmicity (in the case 
of a sequence acting as a IMES). As above, a NES results in increased expression of 
a gene operably linked to the promoter that is sensitive to or regulated by GalSOp, 

5 and a NLS results in decreased expression of the gene operably linked to the 
promoter that is sensitive to or regulated by GalSOp. 

[033] In a tenth aspect, the invention provides a galactose-independent method 
of activating GAL4 protein regulatable gene expression in multicellular organisms or 
cells therefrom. The method comprises the steps of introducing into a cell from a 

10 multicellular organism a gene expressing a first protein or fragment thereof fused 
with the amino- or carboxykerminus of GalSOp; introducing into said cell from a 
multicellular organism a gene expressing a second protein or fragment thereof fused 
to a membrane targeting sequence; and treating the cell from a multicellular 
organism with an inducing agent, wherein treatment mediates the cytoplasmic 

15 interaction of the first and second proteins, relieving GalSOp inhibition and permitting 
Gal4p activation of GAL4 protein regulatable promoters, wherein the promoters are 
fused to a heterologous gene. The Gal4p or a fragment thereof can be expressed 
from a chromosomal Gal4p gene or a heterologous Gal4p gene introduced into the 
multicellular organisms or cells therefrom. The membrane targeting sequences can 

20 be a multicellular organism-based or synthetic myristoylation sequence, 
mitochondrial outer membrane targeting sequence, or other membrane 
targeting/anchoring sequences. In certain embodiments, the myristoylation 
sequence can be MGCTVSTQTIGDESDP (SEQ ID N0:1) or multicellular species- 
specific homolog thereof. Similarly, in certain embodiments, the mitochondrial outer 

25 membrane targeting sequence can be the N-terminal sequence of Tom70/l\/las70 
protein, MKSFITRNKTAILATVAATGTAIGAYYYY (SEQ ID N0:3) or multicellular species- 
specific homolog thereof. 



[034] In certain embodiments of this aspect of the invention, the first and/or 
second proteins can be any protein encoded by a cDNA. In certain embodiments, 
the first protein can be Fpri and the second protein can be Cnal. The induction agent 
can be any molecule that mediates the interaction of said first and second proteins, 
5 and in certain embodiments the induction agent can be FK506. 

[035] The GAL4 protein-regulatable genes can be expressed from UASqal 
containing promoters. More specifically, the GAL4 protein regulatable gene can be 
transcribed through any promoter under the control of UASqal sequences and the 
UAScAL-specific transcriptional activator sequences. 

10 [036] In an eleventh aspect, the invention provides methods for regulatable 
gene expression in multicellular organisms or cells therefrom comprising the steps 
of introducing a recombinant expression construct encoding GalSOp into a 
multicellular organism or cell therefrom, introducing a recombinant expression 
construct encoding Gal3p into the multicellular organism or cell therefrom, 

15 introducing a recombinant expression construct encoding Gal4p into the multicellular 
organism or cell therefrom, introducing a coding sequence operably linked to a 
promoter sensitive to or regulated by Gal3p/Gal4p/Gal80p, and treating the 
multicellular organism or cell therefrom with an inducing agent, wherein treatment 
mediates the cytoplasmic interaction of the Gal3p and GalSOp, relieving GalSOp 

20 inhibition and permitting Gal4p activation of the coding sequence operably linked to 
the promoter sensitive to or regulated by the GalSOp, Gal3p and Gal4p, wherein the 
promoters are fused to a heterologous gene. In certain embodiments, the inducing 
agent is galactose. In other embodiments, Gal4p or a fragment thereof is expressed 
from a heterologous Gal4p gene introduced into the multicellular organisms or cells 

25 therefrom. In still other embodiments, the coding sequence is operably linked to a 
promoter sensitive to or regulated by Gal3p, Gal4p and GalSOp are expressed from 
UASGAL-containing promoters. 



19 



[037] In a twelfth aspect, the invention provides methods for regulatable gene 
expression in multicellular organisms or cells therefrom comprising the steps of 
introducing a recombinant expression construct encoding GalSOp into the cell from a 
multicellular organism, introducing a recombinant expression construct encoding 

5 Gal3p into said cell from a multicellular organism, introducing a recombinant 

expression construct encoding Gal4p or a fragment thereof fused to a transcription 
factor or DNA-binding portion thereof into said cell of a multicellular organism, 
wherein the transcription factor or DNA binding fragment thereof is a transcription 
factor from a multicellular organism, introducing a coding sequence operably linked 

10 to a promoter sensitive to or regulated by the transcription factor, and treating the 
cell from a multicellular organism with an inducing agent, wherein treatment 
mediates the cytoplasmic interaction of the GaISp and GalSOp, relieving GalSOp 
inhibition of the Gal4p-transcription inhibitor fusion protein and permitting activation 
of the coding sequence operably linked to the promoter sensitive to or regulated by 

15 the transcription factor. In certain embodiments, the coding sequence operably 
linked to the promoter sensitive to or regulated by the transcription factor is an 
endogenous gene. In other embodiments, the promoter sensitive to or regulated by 
the transcription factor is an endogenous promoters. In still other embodiments, the 
promoters operably linked to a heterologous gene. In yet further embodiments, the 

20 promoter is linked to a reporter gene. In other embodiments, the inducing agent is 
galactose. 

[038] Specific preferred embodiments of the present invention will become 
evident from the following more detailed description of certain preferred 
embodiments and the claims. 

25 

BRIEF DESCRIPTION OF tHE DRAWINGS 

[039] Figures lA through IC. (Yeast cells) Sequestration of Gal3p to cellular 
membranes does not affect its induction function. The results of assays performed 
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as described in Example 1 establishes that Gal3p need not enter the yeast nucleus 
to mediate activation of Gal4p, and in turn, cause activation of Gal4p-activatable GAL 
gene promoters. Figure lA illustrates localization of Gal3p fused to green 
fluorescence protein (GFP) as visualized by fluorescence microscopy. Cells of yeast 
5 strain Sc781 (gal3A gallA) carrying the indicated low-copy expression plasmid were 
grown to mid-log phase in medium containing 2% glycerol and 3% lactic acid as 
carbon source and induced with 2% galactose for 3 hours before microscopic 
observation. Green fluorescent protein (GFP) is inserted at the C termini of each of 
wild-type Gal3p, myristoylated Gal3p (Myr-Gal3p) and mMyr-Gal3p. The vacuolar 

10 (yeast lysosomal) membrane is readily seen in cells expressing Myr-Gal3GFP. For 
mitochondrial-associated Gal3p, the GFP sequence is positioned between the signal 
anchor sequence and the Gal3p sequence. Yeast mitochondria form the typical 
reticulum structure under the experiment conditions used. Figure IB shows MELl 
induction in cells carrying Gal3p derivatives. The results were obtained after 6 hours 

15 of galactose induction and were normalized to results obtained from cells carrying 
wild type Gal3p. The a-galactosidase induction assay was carried out essentially as 
described (Peng and Hopper, 2000, Mo/. Cell. Biol. 20: 5140-5148). Figure IC 
shows membrane-associated Gal3p that sequesters GalBOp to membranes. Cells of 
yeast Sc786 strain igalSOA gal3A gallA) carrying a low-copy plasmid expreissing 

20 Gal80-GFP together with wild type Gal3p, or Myr-Gal3p, or mMyr-Gal3p, as indicated, 
were observed as in Figure lA. The images in Figure IC illustrate that Gal80-GFP 
molecules interact with membrane-associated Gal3p in galactose. 

[040] Figures 2A through 2C. (Yeast cells) Lowering Gal3p expression levels 
causes reduction in GAL gene expression. Figure 2A shows sequence differences 

25 between different Gal3p encoding plasmids. pGP74 encodes a wild-type Gal3p 
expressed from an ADH2 promoter. pGP144 encodes myristoylated Gal3p (Myr- 
Gal3p) expressed from an identical M)H2 promoter. pGP147 encodes a wild-type 
Gal3p expressed from a modified ADH2 promoter containing a SObp sequence 



insertion between the promoter sequence and the start codon of Gal3p. Numbers in 
the promoter region correspond to the distance from the start codon in the ADH2 
gene Figure 2B illustrates results showing that different Gal3p expression levels 
were conferred by different Gal3p-encoding plasmids. Protein extracts from cells 
5 grown in the absence of galactose and from cells grown for 6 hours in the presence 
of galactose were analyzed by Western blot. Figure 2C shows MELl induction in 
cells carrying plasmids pGP74, pGP144 or pGP147, Cells carrying each of the 
respective plasmids were induced with 0.5% galactose for 6 hours and cellular 
extracts prepared. Aliquots of the cellular extracts were subjected to Western blot 

10 analyses as shown in Figure 2B to provide estimates for GaISp expression levels. 
The a-galactosidase induction assay was carried out essentially as described (Peng 
and Hopper, 2000, Mol. Ceil. Biol. 20: 5140-8). Both GaISp expression levels and 
MELl induction results were normalized to those of cells carrying pGP74. 
[041] Figure 3A through 3C. (Yeast cells) Analysis of the association of 

15 Gal4p, GalSOp and GaISp with UASgal sequences using chromatin 

immunoprecipitation. These data establish that galactose triggers a reduction of the 
amount of GalSOp bound to Gal4p at the UASgal site within GAL gene promoters. 
When considered with the illustration in Fig. IC, these data establish that interaction 
of GaISp and GalSOp in the cytoplasm leads to a reduction of GalSOp bound to Gal4p 

20 in the nucleus within the GAL gene promoters. Yeast cells were grown to early-log 
phase in medium containing 2% glycerol/3% lactic acid/O.05% glucose (-galactose, 
un-induced) and induced with 2% galactose for 20 minutes (-i-galactose, induced). 
The 344 bp PGR product (UASqal) corresponds to the region extending from -600 to 
-257 relative to the -hi of the GALl gene. The 2S7 bp PGR product (intergenic 

25 control) corresponds to an intergenic region located 5 Kb downstream from the 
GALl promoter and served as a control for background binding in the 
immunoprecipitation. The ratio between the 344 bp and 287 bp PGR products (fold 
enrichment) is calculated to normalize results and assess promoter occupancy. 
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Figure 3A shows Gal4p and GalSOp occupancy at UASgal sites in un-induced and 
induced cells. The results from two-fold dilution of the template DNA used for the 
PGR reactions demonstrated that the reaction was within the linear range. Fold 
enrichments are indicated (enrichment). Figure 3B shows GalSGp and Gal3p 
5 association with the UASgal region evaluated before and after galactose induction. 
Figure 3C illustrates quantitation of promoter occupancy by Gal4p, GalSOp and 
Gal3p in un-induced and induced cells. Values are means (± standard deviation) of at 
least five experiments carried out with two sets of independently prepared chromatin 
samples. 

10 [042] Figures 4A and 4B. (Yeast cells) Surrogate protein-protein interaction 
activates GAL gene expression. Figure 4A illustrates a small molecule other than 
galactose can be used to induce Gal4p mediated Gal promoter-mediated expression. 
Figure 4A illustrates FK506-dependent H/S3 reporter expression. Gal80-Fprl 
designates the GalSO-Fprlp fusion. Myr-Cnal designates the plasma membrane 

15 targeted Cnalp. Myr-Cmdl is a plasma membrane-targeted calmodulin and is used 
as a control. The nutrient agar plates used here lacked histidine, tryptophan and 
uracil and contained 0.5% galactose/3% glycerol/2% lactic acid as carbon source. 
FK506 was used at 0.25 ^ig/mL where indicated. Identical results were obtained 
using nutrient agar lacking galactose (data not shown). Figure 4B shows galactose- 

20 independent HIS3 reporter expression as a result of GalSO-Fprlp and Myr-Hom3p 
interaction. The nutrient agar plate lacked histidine, tryptophan and uracil and 
contained 3% glycerol/2% lactic acid as carbon source. Galactose was used at 
0.5% where indicated. The lack of tryptophan and uracil are required for selection 
and maintenance in yeast of the two plasmids GalSO-Fprlp and Myr-Cnal p, and the 

25 lack of histidine is to provide a selective assay for the protein-protein interaction that 
activates the HIS3 reporter gene carried oh a yeast chromosome. 

[043] Figure 5. (Yeast cells) A membrane-targeted transcription factor 
activates GAL gene expression through interaction with GalSO-TBP. Galactose- 



independent HISS reporter (expression as a result of Gal80-TBP and Myr-TAF145 
interaction; lane 1). Lane 2 illustrates the positive control: interaction between two 
proteins known to interact: Fprl (in fusion with (5al80) and Hom3 (fused to the Myr 
sequence and CFP). Lane 3 is the negative control with Gal80-TBP tested against 
5 just the Myr sequence. The nutrient agar plate used here contained 3% glycerol/2% 
lactic acid as carbon source and lacked histidine, tryptophan and uracil. 

[044] Figure 6. (Yeast cells) Membrane targeted transcriptional activation 
domain of Gal4p (Gal4AD; amino acid 758-881) activates HISS reporter gene 
expression. A yeast strain harboring a UASgal-HIS3 reporter was transformed with 

10 Myr-Gal4AD. The photomicrograph on the right shows the membrane localization of 
Myr-Gal4AD in yeast cells. Transformed cells were spotted on synthetic medium 
plates lacking histidine, tryptophan and uracil and containing 3% glycerol/2% lactic 
acid as carbon source. Cells showed evident growth in the absence of galactose, 
indicating an interaction between membrane-localized Gal4AD and GalSOp caused 

15 activation of the HISS reporter gene. As controls, cells carrying Gal3p exhibited 
growth only in the presence of galactose. 

[045] Figure 7. (Yeast cells) Addition of exogenous nuclear export sequence 
(NES) to GalSOp causes increased Gal4p-mediated reporter gene activation. Fusion 
of an NES to Gal80p increases Ga\Ap activation of a reporter gene in response to 

20 low levels of galactose. Top row, Pcaso-GalSO-NES. 2. Middle row, Pcaso-GalSO. 3. 
Bottom row, GalSO-deletion. The host yeast strain: Sc725 (ura3', trpl , his3", 
galBOA). Growth of yeast on agar lacking uracil and histidine and containing lOmM 
3-Azotriazole (to eliminate leaky HIS3 expression) and carbon sources 3% glycerol, 
2%lactic acid, 0.02% glucose and 0.1% galactose. Photo taken 7 days after 

25 spotting. 

[046] Figure 8. (Multicellular organisms or cells therefrom) GalSOGFP in cells 
from a multicellular organism (mouse 3T3). Note that GalSOGFP is in both the 
nucleus and the cytosol, but is excluded from the nucleoli. Also note size of nuclei is 
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same as in case of Figure 9 wiiere perimeter of nuclei is more defined due to 
exclusion of fluorescent Gal3GFP. Imaging was performed with an Olympus Confocal 
Scanning Microscope using a lOOX objective. 

[047] Figure 9. (Multicellular organisms or cells therefrom) GalSGFP in cells 
5 from a multicellular organism (mouse 3T3). Note that the nuclei are devoid of . 
GalSGFP. Imaging was performed with an Olympus Confocal Scanning Microscope 
using a lOOx objective. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

10 

[048] All references, patents and patent applications are hereby incorporated by 
reference in their entirety. 

[049] According to the teachings of this specification, unless otherwise stated, 
the techniques utilized may be found in any of several references known in the art, 

15 including but not limited to: Molecular Cloning: A Laboratory Manual, 3"^ ed. 
(Sambrook eta/., 2001, Cold Spring Harbor Laboratory Press: New York); Gene 
Expression Technp/ogy (Methods in Enzymology, Vol. 185, Goeddel, ed.. Academic 
Press, San Diego, OA, 1991); "Guide to Protein Purification" in Methods in 
Enzymology (Deutshcer, ed., (1990) Academic Press, Inc.); PCR Protocols: A Guide 

20 to Methods and Applications (Innis et a/., 1990, Academic Press, San Diego, CA); 
Culture of Animal Cells: A Manual of Basic Technique, 2"^ Ed. (Freshney, 1987, Liss, 
Inc. New York, NY); Gene Transfer and Expression Protocols, pp. 109-128, Murray, 
ed., The Humana Press Inc., Clifton, N.J.), and the Promega 1996 Protocols and 
Applications Guide, S""^ Ed. (Promega, Madison, Wl). 

25 [050] This invention provides methods useful in multicellular organisms or cells 
therefrom for producing regulatable gene expression, detecting and analyzing 
protein sequences, and detecting and analyzing protein-protein interactions and 
agonists and antagonists thereof in multicellular organisms or cells therefrom. In 
particular, methods for detecting protein-protein interactions are provided that do 



not suffer from the limitations of existing methods used for multicellular organisms 
or cells therefrom referred to above. 

[051] The present invention provides methods based on previously 
unrecognized features and characteristics of the Gal3p-Gal80p-Ga|4p gene switch of 

5 the yeast Sacchardmyces cerevisiae. Specifically, prior to the instant invention it 
was unknown whether the yeast GAL gene switch could be used in multicellular 
organisms or cells therefrom to provide regulatable, in contrast to constitutive, gene 
expression. More specifically, it was not known in multicellular organisms or cells 
therefrom whether; i) GalSOp could shuttle between cytoplasm and the nucleus and, 

10 ii) Gal3p is restricted to the cytoplasm. These features are necessary for the 

"native" GAL switch to work in multicellular organisms or cells therefrom as it does in 
yeast. The realization that these two features of the GAL gene switch mechanism 
can be realized in multicellular organisms or cells therefrom is provided herein. As 
disclosed in more detail herein, GalSOp can shuttle between cytoplasm and the 

15 nucleus and Gal3p is restricted to the cytoplasm in cells from a multicellular 
organism. Thus, the Gal3p-Gal80p-Gal4p gene switch can be used in multicellular 
organisms or cells therefrom to provide methods for regulatable gene expression, 
detecting and analyzing protein sequences, and detecting and analyzing protein- 
protein interactions and agonists and antagonists thereof in multicellular organisms 

20 or cells therefrom. Also provided herein are Gal80-Trap-like systems utilizing other 
proteins to accomplish the same investigatory roles in multicellular organisms or 
cells therefrom. 

[052] As used herein, the plurality of methods provided by this invention are 
collectively termed M-GAL. The invention provides methods based on a GAL gene 
25 switch for detecting and analyzing protein-protein interactions, nuclear export and 
nuclear import sequences and galactose-independent and galactose-dependent 
inducibility of Gal4p-mediated gene expression in multicellular organisms or cells 
therefrom. M-GAL embodies some basic principles of the Y-GAL gene switch as 



described in co-owned and co-pending U.S. Patent Application Serial No. 
10/165,873. Thus, altliough the M-GAL methods are based on the Gal3p-Gal80p- 
Gal4p gene switch of yeast, M-GAL is a system based entirely in multicellular 
organisms or cells therefrom, M-GAL can comprise the Gal3p-Gal80p-Gal4p gene 

5 switch elements designed into a platform of multicellular organisms or cells 
therefrom, but, importantly, the M-GAL methods can be designed to operate 
analogously to native Y-GAL and thus comprise other proteins that serve the same 
biological purpose of the Gal3p-Gal80p-Gal4p gene switch proteins. 

[053] The present invention provides methods for detecting and analyzing 

10 protein-protein interactions and agonists and antagonists thereof, detecting and 
analyzing protein sequences, and regulatable gene expression in multicellular 
organisms or cells therefrom, as well as reagents for performing said methods in 
multicellular organisms or cells therefrom. 

[054] This invention is based, in part, on the discovery that GalBOp, which is an 

15 inhibitor of the transcriptional activator Gal4p, can be prevented from inhibiting 
Gal4p-mediated transcriptional activation of GAL gene promoters when GalSOp is 
preferentially sequestered or trapped in the cytoplasm. GalSOp binds with high 
affinity to a specific portion of the Gal4p transcriptional activation domain, and 
blocks Ga|4p's ability to recruit RNA polymerase II at UASg/u. sites contained in GAL 

20 gene promoters. In native yeast, Gal80p binding within the Gal4p transcriptional 
activation domain is relieved by the interaction between Gal3p and GalSOp, an 
interaction that occurs only in the presence of galactose. Normally GalSOp shuttles 
dynamically between the yeast nucleus and cytoplasm, and under steady state 
conditions in the absence of galactose a sufficiently high concentration of GalSOp is 

25 in the nucleus to bind to Gal4p and keep Gal4p inactive and the GAL regulon 
transcriptionally silent. However, in the presence of galactose, GalSOp, binds to 
Gal3p molecules that are normally confined to the cytoplasm in native yeast cells, 
resulting in insufficient GalSOp in the nucleus to inhibit Gal4p. Consequently, Gal4p is 
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able to activate genes of the galactose regulon, as well as reporter genes expressed 
via UASg/\l promoters. We discovered that the principles of the Gal80-Trap system 
could translate to other interacting proteins, i.e., proteins other than Gal80p and 
Gal3p. and, unexpectedly, that these systems could translate to more complex 
5 systems than yeast, namely multicellular organisms and cells therefrom. Thus, 
wheriever the Gal80-Trap system is described herein, even when specifically 
describing, for example, GalSOp, Gal3p and galactose, the principles apply to 
analogous GalBO-Trap-like systems that comprise interacting proteins that 
correspond to the GalBOp transcriptional repressor, the Gal3p binding partner (or 

10 "second protein") and the galactose effector of protein-protein interactions, the 
actions of which can combine to influence the transcription of a promoter in the 
multicellular organisms or cells therefrom. Preferably, the transcribed protein is 
detectable, thus enabling the indirect detection of protein-protein interactions. 
[055] To elaborate, the methods provided by this invention can be used with any 

15 transcriptional regulatory factors wherein the expression of a gene is mediated by a 
transcriptional activator that is in turn regulated by a nucleocytoplasmic 
transcriptional inhibitor. In the practice of the inventive methods, the known ("bait") 
or unknown ("prey") proteins or protein-binding fragments can be fused to any 
protein that functions as both a nucleocytoplasmic shuttling protein and an inhibitor 

20 or repressor of transcriptional activation. Localization of the transcriptional 

inhibitor/represser in the nucleus prevents transcriptional activation by inhibiting the 
transcriptional activator, repressing recruitment of RNA polymerase, or other means. 
Conversely, localization of the transcriptional inhibitor in the cytoplasm permits 
transcriptional activator-mediated gene expression. Proteins or protein-binding 

25 fragments that interact with the protein or protein-binding fragment that is fused to 
the transcriptional inhibitor are targeted to the cytoplasm, preferably by fusion to a 
sequence specific for cytoplasmic localization, such as a multicellular organisrrv 
based or synthetic cytoplasmic membrane localization sequence. Protein-protein 



interactions between the bait and prey proteins results in an imbalance in the l<inetics 
or stoichiometry of nuclear/cytoplasmic partitioning of the transcription inhibitor in 
favor of the cytoplasm, thereby releasing transcription inhibition and increasing 
transcription of genes regulated by promoters sensitive to the inhibitor. As 
5 described above, the present invention comprises bait-bait, prey-prey, prey-bait and 
bait-prey couplings, depending on whether the proteins or protein-binding fragments 
fused to the transcription inhibitor and membrane targeting sequence are known, 
i.e., bait or unknown, i.e., prey. In some embodiments, genes having increased 
transcription as a consequence of cytoplasmic localization of the transcription 

10 inhibitor are endogenous cellular genes or reporter genes operably linked to an 
endogenous or exogenous promoter sensitive to the inhibitor. In other related 
embodiments, genes having increased transcription as a consequence of 
cytoplasmic localization of the transcription inhibitor are exogenous cellular genes or 
reporter genes operably linked to an endogenous or exogenous promoter sensitive 

15 to the inhibitor. The promoter sensitivity is preferably mediated by the promoter 
being dependent on a transcription activator whose transcriptional activating activity 
is inhibited by the transcription inhibitor. The methods of the invention are provided 
to be used in the forward direction, i.e. detection of gene expression in the presence 
of protein-protein interactions, or the reverse direction, i.e, detection of loss of gene 

20 expression when said protein-protein interaction is inhibited or disrupted. 

[056] The invention provides a method for detecting protein-protein interactions 
in a host cell cytoplasm, the method comprising the steps of introducing into a cell a 
gene, or coding sequence (for example cDNA), expressing a first protein or protein- 
binding fragment thereof fused with the amino- or carboxyl- terminus of a 

25 transcriptional inhibitor; introducing into said cell a gene, or coding sequence (for 
example cDNA), expressing a second protein or protein-binding fragment thereof 
fused to a cytoplasm localization sequence, wherein upon interaction of the first and 
second proteins in the cell cytoplasm, said transcriptional inhibitor is preferentially 
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localized to the cytoplasm, wherein transcription of a gene operably linked to a 
promoter that is sensitive to or regulated by said transcriptional inhibitor is 
increased; and wherein the protein-protein interaction is detected by detecting said 
increased transcription of said gene. 

5 [057] In preferred embodiments, the cytoplasm localization sequence is a 
membrane targeting sequence, preferably a multicellular organism-based or 
synthetic cell membrane targeting sequence or a mitochondrial membrane targeting 
sequence. Preferably, the cell is a cell from a multicellular organism. In certain 
embodiments, the gene, or coding sequence (for example cDNA), is operably linked 

10 to a promoter that is sensitive to or regulated by said transcriptional inhibitor and is 
an exogenous reporter gene, most preferably encoding a detectable product, such 
as a product that produces a detectable metabolite or detectable phenotype (e.g., a 
growth advantage or disadvantage). 

[058] As used herein, the term "gene" can comprise a native genetic sequence 

15 or other natural or synthetic, processed or unprocessed (i.e., a gene including exons 
and introns or a cDNA), coding or non-coding sequence. 

[059] As used herein, the term "operably linked" refers to components such as 
genetic components that are in a relationship, specifically covalently linked to one 
another, permitting them to function in their intended manner. For example, a 

20 control sequence "operably linked" to a protein coding sequence is ligated thereto so 
that expression of the protein coding sequence is achieved under conditions 
compatible with the transcriptional activity of the control sequences. In particular, a 
transcriptionally-regulated promoter is covalently linked in an orientation, typically 5' 
to an open reading frame encoding an amino acid sequence of a protein, such as a 

25 reporter gene or a fusion protein of the invention, so that transcription of the coding 
sequence is appropriately regulated. 

[060] As used herein, the terms "fused" and "fusion protein" are intended to 
mean that the sequences, such as the amino acid sequences of the protein, are 

30 



covalently linked to one another. When used with regard to nucleic acids encoding a 
fusion protein, the term refers to covalent linkage of the sequences to be properly 
"in frame," so that translation of a mRNA encoding the fusion protein properly 
translates the individual components thereof in the proper reading frame of the 

5 genetic code. 

[061] As used herein, the term "protein binding fragments" is intended to 
encompass fragnients of a protein involved in a protein-protein interaction that 
produces or is capable of producing a detectable signal according to the methods of 
the invention. Said fragments are capable of mediating the protein-protein 

10 interaction and are therefore necessary components of the fusion proteins of the 
invention. It will be recognized that other portions of the native protein involved in 
the protein-protein interactions assayed by the inventive methods may contribute to 
the interaction, but at least the portions termed "protein binding fragments" are 
required for the protein-protein interaction to take place. 

15 [062] As used herein, the term "bait" refers to a known protein or protein- 
binding fragment, or coding sequence therefor. Bait can be fused to a 
transcriptional inhibitor, for example GalSOp, or other sequences, for example 
multicellular organism-based or synthetic membrane targeting sequences. Herein, 
bait can bind to other known proteins or protein-binding fragments (i.e., other bait), 

20 or to unknown proteins or protein-binding fragments (i.e., prey). Thus, the present 
invention comprises bait-bait, bait-prey, prey-bait and prey-prey interactions. 

[063] As used herein, the term "prey" refers to an unknown protein or protein- 
binding fragment or coding sequence therefore. Prey can be fused to a 
transcriptional inhibitor, for example GalSOp, or other sequences, for example 

25 multicellular organism-based or synthetic membrane targeting sequences. Herein, 
prey can bind to other known proteins or protein-binding fragments {i.e., bait), or to 
other unknown proteins or protein-binding fragments (i.e., other prey). Thus, the 
present invention comprises bait-bait, bait-prey, prey-bait and prey-prey interactions. 



[064] In certain embodiments, tlie gene operably linked to a promoter that is 
sensitive to or regulated by the transcriptional inhibitor to which a protein or protein- 
binding fragment is fused is a selectable gene, wherein increased transcription of 
said selectable gene permits growth of the cell in selective conditions. In these 
5 embodiments, the inventive methods further comprise subjecting the host cell to 
selective growth conditions, and detecting increased growth or survival of said cells 
under selective growth conditions, wherein said protein-protein interaction is 
detected thereby. 

[065] In certain embodiments, a known (bait) or unknown (prey) protein or 

10 protein-binding fragment is fused to any native transcriptional inhibitor from a cell 
from a multicellular organism to create a fusion protein. Said fusion protein can also 
comprise additional sequences, for example, a myristoylation or mitochondrial outer 
membrane sequence from a multicellular organism. Alternatively, where a promoter 
that is responsive to the native Gal80-Trap system components is present in the cell 

15 from a multicellular organism (i.e„ a GALuAs-sequence containing promoter is 
exogenously introduced or introduced using standard genetic techniques), and 
optionally operatively-coupled to a gene than encodes a protein whose transcription 
is detectable, GalSOp can serve as the transcriptional inhibitor to which a bait protein 
or protein binding fragment can be fused at either the GalSOp amino- or carboxyl- 

20 terminus. The binding partner of the above-mentioned known (bait) or unknown 
(prey) protein or protein-binding fragment fused to any native transcriptional inhibitor 
from a cell from a multicellular organism to create a fusion protein, is either another 
known protein or protein-binding fragment or unknown protein or protein-binding 
fragment, for example, one of a multiplicity of unidentified potential partners 

25 encoded in a cDNA library. This binding partner is tethered in the cytoplasm, 
particularly to a membrane via fusion to a membrane targeting sequences, for 
example, a multicellular organism-based or synthetic myristoylation sequence or one 
of a multiplicity of plasma membrane, vesicular membrane or mitochondrial outer 



membrane targeting sequences known in the art. If the bait-bait, bait-prey, prey-bait 
or prey-prey partners interact in the cytoplasm, the transcriptional inhibitor is 
preferentially partitioned or trapped in the cytoplasm and prevented from inhibiting 
transcriptional activation of its corresponding promoter(s) in the nucleus. 

5 [066] In additional embodiments, recombinant molecules are provided 
comprising the Gal4p transcription activation domain fused to the DNA-bihding 
domain of other endogenous or exogenous DNA site-specific binding proteins. Using 
such constructs, reduced levels of GalSOp in the nucleus activate genes whose 
promoters contain the cognate binding site for the DNA-specific binding protein 

10 domain fused to the Gal4p transcription activation domain. 

[067] As described above, the methods and reagents of the invention are 
designed to allow bait and prey fusions in either direction, i.e., wherein the "bait" 
protein or specific binding fragment thereof is fused to the transcriptional repressor 
protein (e.g., GalSOp) and the "prey" protein or specific binding fragment thereof is 

15 specifically localized and restricted to the cytoplasm, or vice versa. The following 
embodiments are provided as examples. Bait fusions can be made with known 
endogenous or exogenous transcription inhibitors, for example GalSOp (such a 
system is termed the "B-80 version"), or bait fusion can be made with multicellular 
organism-based or synthetic membrane targeting sequences, for example, a 

20 myristoylation signal or a mitochondria outer membrane (MOM) targeting signal (B- 
Myr or B-MOM versions), or any other sequence that specifically localizes or restricts 
the prey to the cytoplasm. The latter two embodiments have the advantage that 
libraries (e.g., cDNA libraries), wherein the prey proteins or protein-binding fragments 
thereof are fused to the transcription inhibitor (to either the N-terminus or C- 

25 terminus), can be used with either the B-Myr or B-MOM versions of the desired bait. 
[068] As an example of the B-80 version of the methods and reagents of the 
invention, the protein of interest is fused with the Gal4p inhibitor, GalSOp. GalSOp 
functions as an inhibitor of Gal4p transcriptional activator and this property is 
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generally retained when other protein sequences are fused to either N- or C-terminus 
of GalSOp. This retention of native activity advantageously increases the likelihood 
that bait proteins fused to GalSOp not interfere with GalSOp's ability to inactivate 
Gal4p transcription activation activity. The invention also provides libraries of either 
5 N-terminal or C-terminal fusions, wherein the fusion protein comprises a plurality of 
fusion proteins in which the transcription inhibitor protein, (such as GalSOp) is fused 
to each of a plurality of members of said cDNA library in each species of fusion 
protein comprising said plurality. In these embodiments, the likelihood that bait 
proteins fused to GalSOp (in this example; to other transcriptional inhibitors in other 
10 analogous systems) will fold in a manner that preserves native interacting surfaces is 
optimized. 

[069] For a protein or protein-binding fragment to be tested or in alternative 
embodiments, wherein the GalSOp fusion is a particular protein of interest and the 
cytoplasmically-restricted fusion protein comprises a plurality of fusion proteins in 

15 which the cytoplasmic localization sequence is fused to each of a plurality of 
hiembers of said cDNA library in each species of fusion protein comprising said 
plurality, for cDNA libraries to be screened, cDNAs are preferably cloned into 
vectors encoding a multicellular organism-based or synthetic myristoylation signal or 
mitochondria outer membrane-targeting signal. The ability to target proteins to 

20 different cellular environments (e.g., the plasma membrane or mitochondrial outer 
membrane) provides utility for assays involving a protein that may require the 
particular environment of the cytoplasm or the mitochondrion or the environment 
proximal to a cellular membrane in the cytoplasm to be fully active and participate in 
its native interactions. As provided by the invention, such constructs can also 

25 advantageously comprise a green fluorescent protein (GFP) tag, or other known tag, 
for convenient verification of protein expression and targeting. 

[070] In particular embodiments, the invention provides methods for detecting 
protein-protein interactions that occur in a host cell cytoplasm comprising the steps 
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of introducing into a cell from a multicellular organism a gene expressing a first 
protein or protein-binding fragment tliereof fused with tlie amino- or carboxyl- 
terminus of GalSOp; introducing into said host cell a gene expressing a second 
protein or protein-binding fragment thereof fused to a multicellular organism-based 

5 or synthetic cytoplasm localization sequence, wherein upon interaction of the first 
and second proteins in the cell cytoplasm, said GalSOp fusion protein is localized to 
the cytoplasm, wherein transcription of a gene operably linked to a promoter that is 
sensitive to or regulated by said transcriptional inhibitor is increased; and detecting 
said increased transcription of said gene wherein said protein-protein interaction is 

10 detected thereby. 

[071] In related embodiments, the cytoplasm localization sequence is a 
membrane targeting sequence, preferably a cell membrane targeting sequence or a 
mitochondrial membrane targeting sequence. Preferably, the cell is a cell from a 
multicellular organism. In certain embodiments, the gene operably linked to a 

15 prpmoter that is sensitive to or regulated by GalSOp is an exogenous reporter gene, 
most preferably encoding a detectable product or a product that produces a 
detectable metabolite. 

[072] In certain related GalSOp-based embodiments, the increased gene 
expression of the gene operably linked to a promoter that is sensitive to or regulated 

20 by GalSOp is a selectable gene, wherein said increased transcription permits growth 
of the cell in selective conditions. In these embodiments, the inventive methods 
further comprise subjecting the host cell to selective growth conditions, and 
detecting increased growth or survival of said cells under selective growth 
conditions, wherein said protein-protein interaction is detected thereby. 

25 [073] In alternative embodiments, the inventive methods provide a method for 
isolating a protein that is a member of an interacting protein pair, or a nucleic acid 
encoding said protein, the method further comprising the steps of isolating the 
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encoding sequence, for example, cDNA sequence, for said second protein and 
optionally expressing said protein in a host cell.. 

[074] In preferred embodiments, the first or second protein or protein-binding 
fragment can be any protein that specifically interacts or, when probing a library for 

5 interacting proteins or protein-binding fragments, putatsvely interacts with another 
protein in a cell. In certain preferred embodiments, the first protein includes but is 
not limited to transcriptional proteins or fusions to transcriptional proteins. As used 
herein the term "transcriptional protein" includes but is not limited to any protein that 
is involved with or a component of the transcriptional machinery of the cell, including 

10 transcriptional activators, transcriptional repressors, RNA polymerase II associated 
proteins, and chromatin associated proteins. In an additional embodiment, the first 
protein is encoded by a member of a cDNA library, wherein said library comprises a 
plurality of fusion proteins in which the transcription inhibitor protein (such as GalSOp) 
is fused to each of a plurality of members of said cDNA library in each species of 

15 fusion protein comprising said plurality. 

[075] In further embodiments, the protein or protein-binding fragment "binding 
partner" (so labeled here to avoid confusion with the "first" or "second" proteins and 
protein-binding fragments that include or are fused to a transcriptional protein) to the 
first or second protein or protein-binding fragment described in the above paragraph 

20 is a fusion with a cytoplasmic localization sequence, preferably encoded by a cloned 
cDNA. This binding partner can also be encoded by a member of a cDNA library, 
wherein the fusion protein comprises a plurality of fusion proteins in which the 
cytoplasmic localization sequence is fused to each of a plurality of members of said 
cDNA library in each species of fusion protein comprising said plurality. In additional 

25 preferred embodiments, this binding partner is fused to the C- or N-terminus of a 
membrane targeting sequence, for example, a multicellular organism-based or 
synthetic membrane targeting sequence. The term "membrane targeting sequence" 
is intended to encompass DNA sequences that result in membrane 
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localization/anchorage of translated proteins. Membranes in this aspect of the 
invention include but are not limited to the cell membrane, mitochondrial outer 
membrane, nuclear membrane, vacuolar membranes, vesicular membranes, 
membranes of the endoplasmic reticulum, membranes of the Golgi apparatus, and 

5 all other organelle membranes. In a preferred embodiment, the binding partner is 
localized to the cellular membrane via fusion to a myristoylation sequence. In an 
additional preferred embodiment, the binding partner is localized to the outer 
mitochondrial membrane via fusion to a Tom70/Mas70 sequence or multicellular 
species-specific homolog thereof, or any other mitochondrial inner membrane 

10 protein or outer membrane protein that protrudes into the cytoplasm. 

[076] The invention also provides host cells harboring expression constructs 
described herein. "Expression constructs" and "recombinant expression constructs" 
will be understood to be genetically-engineered nucleic acid sequences encoding at a 
minimum an origin of replication, a selectable marker and a gene comprising a 

15 polypeptide-encoding nucleic acid that Is expressed in a recipient host cell. In 
preferred embodiments, recombinant expression constructs can comprise coding 
sequences for the bait and prey protein-transcription factor fusions as described 
herein and a coding sequence that encodes a detectable product or causes a 
detectable phenotype. In other embodiments, cDNAs and galactose regulon 

20 components described above can be used in the construction of recombinant 
expression constructs. In additional embodiments, cDNAs and glucose regulon 
components described above can be used in the construction of recombinant 
expression constructs. In certain aspects, the recombinant expression constructs of 
the invention most preferably comprise coding sequences that encode a 

25 transcription inhibitor. The recombinant expression constructs of the invention can 
encode transcription factors derived from the galactose regulon, preferably Gal80p, 
and the glucose regulon, for example, Miglp and Mig2p. The recombinant 
expression constructs also comprise protein-encoding cDNAs fused to the amino- or 



carboxyl-terminal sequence of a transcription inhibitor of choice, for example, 
GalSOp (galactose regulon) or Miglp (glucose regulon). In a further embodiment, the 
recombinant expression constructs comprise protein-encoding cDNAs fused to 
membrane targeting peptides. In additional embodiments, the recombinant 
5 expression constructs encode membrane targeting signal peptides fused to protein- 
encoding cDNAs or a plurality of protein-encoding cDNAs in one or more cDNA 
libraries. In a preferred embodiment, the expression constructs further encode a 
myristoylation sequence or a membrane targeting sequence, for example, 
Tom70/Mas70 for yeast, or known membrane targeting sequence applicable to the 

10 multicellular species of interest, or of synthetic origin. 

[077] In a further embodiment, the selectable growth conditions are chosen to 
select for multicellular organisms or cells therefrom comprising a selectable reporter 
gene wherein increased expression thereof confers a growth advantage on the cell 
or distinguishes the cell in a detectable manner. In these embodiments, protein- 

15 protein interactions are determined by expression of selectable or detectable 
reporter genes, or subsequent growth of the host cell arising from expression of a 
selectable reporter gene, or a combination thereof. In this embodiment, protein- 
protein interaction permits transcriptional activation and expression of a selectable 
or detectable reporter gene, or subsequent growth of the host cell arising from 

20 expression of a selectable reporter gene, or a combination thereof. In the practice 
of one aspect of this embodiment of the invention, the selectable reporter gene is 
expressed to high enough expression levels to provide sufficient production of the 
gene product to permit cell growth under selectable conditions. 

[078] In additional related embodiments, the selectable reporter gene encodes 

25 a product that converts a non-toxic compound to a cytotoxic or cytostatic 

compound. In such embodiments, decreased expression of said gene is detected 
by cell growth or survival; whereas, increased expression of said gene is detected 
by Cell death. 
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[079] In a further embodiment, the invention provides methods for detecting 
protein-protein interactions by expression of a detectable or selectable reporter 
gene. A detectable reporter gene as used in these methods is any that is 
detectable, for example visibly, by the presence of the gene product. Examples of 

5 selectable reporter genes include but are not limited to green fluorescence protein 
(GFP), cyan fluorescence protein (CFP), yellow fluorescent protein (YFP), the red 
fluorescent protein DsRed, beta-glucoronidase, luciferase, and lad (encoding beta- 
galactosidase). A selectable reporter gene as used in these methods is any that can 
be used to, for example, enable or prevent the growth of a cell expressing said 

10 gene. Examples of selectable reporter genes include but are not limited to HISS, 
ADE2, ADE5, ADE7, LYS2, LEU2, URA3, TRPl, KAN, NEO (neomycin resistance for 
growth in (5418), growth on hypoxanthine-aminopterin-thymidine (HAT) media for 
thymidine kinase, growth in the presence of pyrimidine analogs such as 
bromodeoxyuridine (BrdU) for thymidine kinase activity, and growth in methotrexate 

15 for dihydrofolate reductase (DHFR). The detectable reporter gene or genes utilized 
may not be the same gene or genes utilized as the selectable reporter gene {i.e., a 
gene required for growth). In another embodiment of this aspect of the inventive 
methods, protein-protein interaction is detected by reporter gene activity. Reporter 
gene activity may be determined by growth (i.e., using a selection protocol), or 

20 biochemical activity, or a biophysical signal such as fluorescence, photon emission, 
change in color spectrum, transfer of radioactive groups, or by binding to an 
antibody and detected either directly or indirectly, for example, by conjugation to a 
detectable marker such as horseradish peroxidase or a fluorescent agent. 
[080] The expression of all selectable or detectable reporter genes is 

25 dependent on the activation of upstream promoters. Upstream promoters have 
required DNA sequence(s) for binding transcriptional activator(s) or a site-specific 
DNA-binding domain that is fused to a transcription activation domain, for example, 
the Gal4p transcriptional activation domain and, optionally, for binding a 
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transcriptional inhibitor directly. Transcription of selectable or detectable reporter 
genes is dependent on binding a transcription activator protein, for example Gal4p 
or its transcription activation domain to the promoter and the absence of binding of 
a transcription Inhibitor, for example GalSOp, either to a binding domain on the 
5 transcriptional activator protein or on the DNA itself. Thus, when the promoter has 
transcriptional activating activity and a less than inhibitory amount of inhibition 
activity {e.g., when a transcriptional inhibitor is sequestered in the cytoplasm by a 
protein anchored to a membrane), the selectable or detectable gene can be 
transcribed. Growth under the appropriate selection conditions for the selectable 

10 marker or detection of a detectable marker demonstrates protein-protein 

interactions occurred between the prey and bait proteins. For example, where the 
native Gal4p is used, the expression of all selectable or detectable reporter genes is 
dependent on the activation of upstream GAL gene promoters having a UASgal DNA 
sequence by binding Gal4p to the promoter and sequestration of the GalSOp 

15 transcription inhibitor in the cytoplasm. 

[081] The term "GAL gene promoter" is intended to encompass any regulatable 
promoter isolated from any of the genes of the yeast galactose regulon including but 
not limited to GALl, GAL7, GALIO, GAL2, GAL5, MELl and other native GAL gene 
promoters indigenous to yeast as well as all artificial, man-made galactose- 

20 responsive promoters that are under the control of the GAL gene promoter specific 
UASgal sequences and the UASG/(L-specific transcriptional activator, Gal4p or 
derivatives of Gal4p that retain the DNA binding domain and the GalSOp-binding 
domain of Gal4p. Alternatively, Gal4-activatable promoters having synthetic UASgal 
sites can be utilized to drive induced expression of polypeptides. 

25 [082] The term "detectable" refers herein to the ability to identify, measure, or 
localize a protein product, either antigenically, immunologically or enzymatically 
within the host cell or cell extracts. Protein products can further comprise additional 
sequences useful for promoting identification or purification of the protein, such as 
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epitope or fluorescent tags. Examples of such epitope and fluorescent tags include, 
but are not limited to FLAG (Sigma Chemical, St. Louis, MO), myc (9E10) (Invitrogen, 
Carlsbad, CA), 6-His (Invitrogen; Novagen, Madison, Wl), HA (Boehringer Manheim 
Biochemicals), green fluorescent protein (GFP), cyan fluorescent protein (CFP), and 

5 the red fluorescent protein DsRed. In one embodiment, polypeptide sequences, or 
fragments thereof, are operatively linked to a nucleic acid sequence encoding an 
"epitope tag", so that the protein is expressed with an epitope tag. The epitope tag 
may be expressed as the amino terminus, the carboxyl terminus, or internal to any of 
the polypeptide chains comprising any of said first protein second protein, or GalSOp 

10 so long as the resulting protein remains functional and is able to interact with other 
proteins. 

[083] The term "isolating" refers to any methods known in the art for isolating or 
purifying DNA. In the practice of certain aspects of this invention, plasmids 
containing cDNAs of interest are isolated from multicellular organisms or cells 

15 therefrom and transformed into £. coli for subsequent amplification, purification, and 
characterization. (Ward, 1990 Nucleic Acids Research 1§: 5319). Such methods for 
isolation include but are not limited to PCR amplification, SDS-alkaline lysis, lithium 
chloride/Triton® X-100, resin-based isolation, and introduction into £. coli, restriction 
digestion, any other methods kriown in the art or any combination thereof. 

20 [084] The term "selectable gene" generally refers to a coding sequence that 
encodes a product, the presence or absence of which can be detected. For 
example, increased expression of said selectable gene can confer a growth 
advantage or a growth disadvantage on the cell or can distinguish the cell in some 
detectable manner. Likewise, decreased expression of said selectable gene can 

25 confer a growth advantage or a growth disadvantage on the cell or can distinguish 
the cell in some detectable manner. Thus, as used herein, said selectable genes 
can be useful in positive as well as negative selection method, as known in the art. 
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[085] The invention also provides methods for detecting in a host cell cytoplasm 
a nuclear export sequence (NES), the method comprising the steps of introducing 
into the cell a recombinant expression construct encoding a protein or protein- 
binding fragment containing a putative NES fused with the amino- or carboxyl- 
5 terminus of a transcription inhibitor; assaying the cell for expression of a gene 
operably linked to a promoter that is sensitive to or regulated by said transcriptional 
inhibitor; and detecting increased expression of said gene, wherein an NES is 
detected thereby. 

[086] In certain aspects of the invention, the transcription inhibitor is any 

10 endogenous transcription factor from the multicellular organism or cell therefrom, 
preferably transcription inhibitor, or exogenous transcription factor, also preferably a 
transcription inhibitor. In particular embodiments, the transcriptional inhibitor is 
GalBOp. In a preferred embodiment, the recombinant expression construct encodes 
a cDNA or a member of a cDNA library, wherein said library comprises a plurality of 

15 fusion proteins in which the transcription inhibitor protein (such as GalSOp) is fused 
to each of a plurality of members of said cDNA library in each species of fusion 
protein comprising said plurality. Also preferred are methods wherein the gene, or 
coding sequence thereof, that is operably linked to a promoter that is sensitive to or 
regulated by said transcriptional inhibitor is a selectable gene, wherein increased 

20 expression of said gene confers a growth advantage on the cell. In these 

embodiments, the inventive methods further comprise subjecting the host cell to 
selective growth conditions, and detecting increased growth or survival of said cells 
under selective growth conditions, wherein said protein-protein interaction is 
detected thereby. In another embodiment, the gene or coding sequence thereof that 

25 is operably linked to a promoter that is sensitive to or regulated by a transcriptional 
inhibitor encodes a detectable protein product. 

[087] The invention also provides methods for isolating NES, the method 
comprising detecting increased expression of a gene operably linked to a promoter 



that is sensitive to or regulated by said transcriptional inhibitor in a cell comprising a 
recombinant expression construct encoding a protein or protein-binding fragment 
containing a putative NES fused with the amino- or carboxyl-terminus of a 
transcription inhibitor; and isolating said recombinant expression construct 
5 comprising said NES from the cell. 

[088] Also preferred are embodiments wherein gene operably linked to a 
promoter that is sensitive to or regulated by a transcriptional inhibitor is an 
exogenous reporter gene encoding a detectable product. In further preferred 
embodiments, the gene operably linked to a promoter that is sensitive to or 

10 regulated by a transcriptional inhibitor is a selectable gene, wherein increased 
expression of said gene confers a growth advantage on the cell. In these 
embodiments, the method further comprises the steps of subjecting the host cell to 
selective growth conditions, and detecting increased growth or survival of said cells 
under selective growth conditions, wherein said NES is detected thereby. 

15 [089] The NES can be isolated from a recombinant expression construct 
comprising a cell expressing a gene operably linked to a promoter that is sensitive 
to or regulated by said transcriptional inhibitor. 

[090] In an additional preferred embodiment, the invention includes methods for 
detecting proteins or protein-binding fragments thereof containing nuclear export 

20 sequence (NES). A cDNAs or plurality thereof that encodes a protein or proteins 
containing an NES sequence are fused to a transcriptional inhibitor, for example, 
GalSOp. The presence of an exogenous nuclear export sequence in said protein 
results in sequestration of the transcription inhibitor in the cytoplasm and subsequent 
activation of one or more promoters that correspond or are sensitive to the 

25 transcription inhibitor. 

[091] The invention also provides a method for detecting a nuclear localization 
sequence (NLS) in a host cell comprising introducing into the cell a recombinant 
expression construct encoding a protein or protein-binding fragment containing a 
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putative NLS, fused with the amino- or carboxyl-terminus of a transcription inhibitor; 
assaying the cell for expression of a gene operably linked to a promoter that is 
sensitive to or regulated by said transcriptional inhibitor; and detecting decreased 
expression of said gene, wherein an NLS is detected thereby, 
5 [092] In a preferred embodiment, the recombinant expression construct 
encodes a cDNA or a member of a cDNA library, wherein the recombinant 
expression construct encodes a fusion protein and wherein the library thereof 
encodes a plurality of fusion proteins in which the transcription inhibitor protein (such 
as GalSOp) is fused to each of a plurality of members of said cDNA library in each 

10 species of fusion protein comprising said plurality, wherein certain of the members 
of the cDNA species comprise an NLS. Also preferred are methods wherein the 
gene that is operably linked to a promoter to or regulated by said transcriptional 
inhibitor encodes a product that converts a non-toxic compound to a cytotoxic or 
cytostatic compound. In additional preferred embodiments, expression of the gene 

15 that is operably linked to a promoter to or regulated by said transcriptional inhibitor 
is inducible when the cell Is contacted with an inducing agent In preferred 
embodiments, the gene that is operably linked to a promoter sensitive to or 
regulated by said transcriptional inhibitor encodes a product that converts a non- 
toxic compound to a cytotoxic or cytostatic compound and wherein the expression 

20 of the gene operably linked to a promoter that is sensitive to or regulated by said 
transcriptional inhibitor is inducible when the cell is contacted with an Inducing agent. 
In such embodiments, decreased expression of said gene is detected by cell growth 
or survival when the cell is contacted with both the inducing agent and the cytotoxic 
or cytostatic compound. 

25 [093] The invention also provides methods for isolating NLS, the method 

comprising the steps of detecting decreased expression of the gene operably linked 
to a promoter that is sensitive to or regulated by said transcriptional inhibitor and 
encoded by a recombinant expression construct encoding a NLS-containing protein 



or NLS-containing fragment thereof fused with the arriino- or carboxyl-terminus of a 
transcription inhibitor; and isolating said recombinant expression construct 
comprising said NLS from the cell. The invention also provides methods for isolating 
NLS, the method comprising the steps of transfecting 

5 [094] In a further embodiment, the invention provides a method for detecting 
proteins or fragments thereof containing one or more nuclear localization sequence. 
In the practice of this aspect of the invention, cDNAs encoding proteins containing a 
putative NLS are fused to a transcription inhibitor such as, for example, GalSOp. The 
presence of an exogenous nuclear localization sequence in said protein results in 

10 sequestration of the transcriptional inhibitor in the nucleus and limits its export to the 
cytoplasm. In the nucleus, the transcription inhibitor inhibits gene (expression. 
Preferably, gene expression imposes a growth disadvantage on said cells, so that 
cells comprising said transcription inhibitor-NLS fusion have a growth advantage. In 
certain related embodiments, any NLSs that are native to the transcriptional inhibitor 

15 can be deleted or mutated {i.e., rendered non-functional or of limited function) prior 
to fusion to putative NLS-containing proteins or protein-binding fragments. In such 
embodiments, basal-level expression of the gene operably linked to the promoter 
that is sensitive to or regulated by said transcriptional inhibitor can be greatly 
increased, thereby enhancing the signal of any non-native NLS introduced in the 

20 methods herein. Such ah embodiment is particularly useful when the product of the 
gene operably linked to the promoter that is sensitive to or regulated by said 
transcriptional inhibitor encodes a product that can convert a non-toxic compound to 
a cytotoxic or cytostatic compound, or simply encodes a toxic or cytostatic product 
itself. The presence of a NLS decreases the expression of a product that is 

25 cytotoxic or cytostatic, thus conferring a growth advantage on those cells that have 
had a functional NLS fused to the protein or protein-binding fragment. GalSOp, for 
example, is known to have a bipartite NLS signal, or two nonadjacent NLSs. (Nogi,Y. 
and Fukasawa, T., Molecular and Cellular Biology 1989 9:3009-3017). Deletion of 



one or both of the NLSs increases the mean residence tinne of GalSOp in the 
cytoplasm, thus increasing the basal transcription of a gene operably linked to the 
promoter that is sensitive to or regulated GalSOp, Thus, when a coding sequence, 
for example, from a cDNA library, that is fused to a GalSOp that has had its native 
5 NLSs mutated or deleted actually encodes an exogenous NLS, the GalSOp-based 
fusion will be targeted to the nucleus. A GalBOp-based fusion protein residing in the 
nucleus will interact with Gal4 and inhibit expression of a gene operably linked to the 
promoter that is sensitive to or regulated by GalSOp. When said gene encodes a 
product that can convert a nontoxic compound to a cytotoxic or cytostatic 

10 compound, or simply encodes a toxic or cytostatic product itself, suppression of its 
expression confers a growth advantage over cells with no functional, exogenous NLS 
sequence fused to the NLS-deleted GalSOp. Such a method provides a powerful 
selection mechanism for isolating NLSs. 

[095] The methods and reagents of the invention can be for galactose- 

15 independent inducible gene expression. In the practice of these methods of the 
invention, one or more recombinant expression constructs can be introduced into 
the cell from a multicellular organism encoding i) a fusion between a first interacting 
protein or protein-binding fragment and the amino- or carboxy-terminal of a 
transcriptional inhibitor (for example GalSOp); ii) a fusion between a second 

20 interacting protein or protein-binding fragment and a cytoplasm localization 
sequence as described above; and iii) a coding sequence operably linked to a 
promoter that is sensitive to or regulated by said transcriptional inhibitor. As 
galactose mediates the interaction between GalSOp and Gal3p, other molecules 
(either protein or non-protein based) can induce interaction between two proteins or 

25 protein-binding fragments, one of which can be fused to the amino- or carboxy- 
terminal of a transcriptional inhibitor and the other of which can be fused to a 
cytoplasm localization sequence as described hereinabove. Thus, any complex of 
three interacting proteins known in the art can be utilized, recombinantly, in a 



multicellular organisms or cells therefrom, to comprise reagents for the methods of 
this embodiment of the invention, one a fusion to the transcription inhibitor, the 
second as a fusion to a cytoplasm localization sequence, and the third (the protein in 
the protein complex binds the two others, i.e., the protein in the middle of the 

5 complex) can act as the inducer or transcription. 

[096] Thus, introduction of such a non-galactose inducing agent can facilitate 
the interaction between the first and second fusion proteins, thus preferentially 
sequestering or trapping the transcription inhibitor in the cytoplasm. The preferential 
sequestration or trapping of the transcription inhibitor in the cytoplasm reduces its 

10 ability to suppress expression of a coding sequence operably linked to a promoter 
that is sensitive to or regulated by said transcriptional inhibitor. Consequently, 
transcription of the coding sequence operabjy linked to a promoter that is sensitive 
to or regulated by said transcriptional inhibitor is stimulated. Preferentially, the 
coding sequence encodes a detectable or selectable product thus enabling easy 

15 determination of altered expression levels of the operably-linked coding sequence. 
In alternative embodiments, putative or potential inducing agents can be screened 
for activity by exposing the multicellular organisms or cells therefrom of this aspect 
of the invention to the putative inducing agent. Increased expression of the coding 
sequence operably-linked to a promoter that is sensitive to or regulated by a 

20 transcriptional inhibitor fused to the first interacting protein or protein-binding 

fragment, indicates positive inducing activity, or the ability to an facilitate interaction 
between the first interacting protein or protein-binding fragment-fusion and the 
second interacting protein or protein-binding fragment-fusion. Decreased expression 
of said coding sequence can indicate negative inducing activity, or anti-inducing 

25 activity. For example, the agent inhibits interactions between said first and second 
interacting proteins or protein-binding fragments. Unaltered expression of said 
coding sequence can indicate a lack of inducing activity. Thus, the invention 
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provides methods for galactose-independent inducible gene expression and 
methods of screening potential inducing agents, 

[097] The invention also provides methods for regulatable gene expression in 
multicellular organisms or cells therefrom comprising the steps of introducing a 

5 recombinant expression construct encoding GalSOp into the cell from a multicellular 
organism, introducing a recombinant expression construct encoding Gal3p into the 
cell from a multicellular organism, introducing a recombinant expression construct 
encoding Gal4p into the cell from a multicellular organism, introducing a coding 
sequence operably linked to a promoter sensitive to or regulated by 

10 Gal3p/Gal4p/Gal80p, and treating the cell from a multicellular organism with an 
inducing agent, wherein treatment mediates the cytoplasmic interaction of the Gal3p 
and GalSOp, relieving GajSOp inhibition and permitting Gal4p activation of the coding 
sequence operably linked to the promoter sensitive to or regulated by the GalSOp, 
GaISp and Gal4p, wherein the promoters are fused to a heterologous gene. In 

15 certain embodiments, the inducing agent is galactose. In other embodiments, Gal4p 
or a fragment thereof is expressed from a heterologous Gal4p gene introduced into 
the multicellular organisms or cells therefrom. In still other embodiments, the coding 
sequence is operably linked to a promoter sensitive to or regulated by GaISp, Gal4p 
and GalSOp are expressed from UAScAL^ontaining promoters. In the practice of 

20 certain aspects of this invention, a gene or coding sequence of interest, which can 
be any coding sequence (including heterologous sequences), is operably coupled to 
a promoter sensitive to or regulated by Gal4p and introduced into the cell from a 
multicellular organism, for example, on an expression vector or by integrating into 
the host genome. In such embodiments, a gene or coding sequence encoding 

25 GalSOp protein and GaISp are introduced into the cell from a multicellular organism, 
for example, on an expression vector or by integrating into the host genome. As 
discussed above, GalSOp, which is an inhibitor of the transcriptional activator Gal4p, 
binds with high affinity to a specific portion of the Gal4p transcriptional activation 



domain, and blocks Gal4p's ability to recruit RNA polymerase II at UASgal sites 
contained in GAL gene promoters. However, GalSOp's binding within the Gal4p 
transcriptional activation domain (and resulting suppression of transcription of the 
gene or coding sequence of interest) is relieved by the interaction between Gal3p 
5 and GalSOp, an Interaction that occurs only in the presence of galactose. In the 
presence of galactose, GalSOp, binds to Gal3p molecules that are normally confined 
to the cytoplasm in the cell from a multicellular organism, resulting in insufficient 
GalSOp in the nucleus to inhibit Gal4p and consequent expression of the gene or 
coding sequence of interest. Consequently, Gal4p is able to activate genes of the 

10 galactose regulon, as well as reporter genes expressed via UASgal promoters. 
Preferably, the gene or coding sequence of interest encodes a product that is 
detectable, thus enabling the detection of the regulatable gene expression in 
multicellular organisms or cells therefrom. 

[098] In a related embodiment, the invention provides methods for regulatable 

15 gene expression in multicellular organisms or cells therefrom comprising the steps 
of introducing a recombinant expression construct encoding GalSOp into the cell 
from a multicellular organism, introducing a recombinant expression construct 
encoding Gal3p into said cell from a multicellular organism, introducing a 
recombinant expression construct encoding Gal4p or a fragment thereof fused to a 

20 transcription factor or DNA-binding portion thereof into said cell of a multicellular 
organism, wherein the transcription factor or DNA binding fragment thereof is a 
transcription factor from a multicellular organism, introducing a coding sequence 
operably linked to a promoter sensitive to or regulated by the transcription factor, 
and treating the cell from a multicellular organism with an inducing agent, wherein 

25 treatment mediates the cytoplasmic interaction of the GaISp and GalSOp, relieving 
GalSOp inhibition of the Gal4p-transcription inhibitor fusion protein and permitting 
activation of the coding sequence operably linked to the promoter sensitive to or 
regulated by the transcription factor. In certain embodiments, the coding sequence 



operably linked to the promoter sensitive to or regulated by the transcription factor 
is an endogenous gene. In other embodiments, the promoter sensitive to or 
regulated by the transcription factor is an endogenous promoters, In still other 
embodiments, the promoters operably linked to a heterologous gene. In yet further 

5 embodiments , the coding sequence operably linked to the promoter sensitive to or 
regulated by the transcription factor is a reporter gene. Preferably, the gene or 
coding sequence of interest encodes a product that is detectable, thus enabling the 
detection of the regulatable gene expression in multicellular organisms or cells 
therefrom. In other embodiments, the inducing agent is galactose. In the practice 

10 of certain aspects of this invention, a gene or coding sequence of interest, which can 
be any coding sequence (including heterologous sequences), is operably coupled to 
a promoter sensitive to or regulated by the transcription factor and introduced into a 
cell from a multicellular organism, for example, on an expression vector or by 
integrating into the host genome. In such embodiments, a gene or coding sequence 

15 encoding GalSOp protein and Gal3p are introduced into the cell from a multicellular 
organism, for example, on an expression vector or by integrating into the host 
genome. As discussed above, GalSOp, which is an inhibitor of the transcriptional 
activator Gal4p, binds with high affinity to a specific portion of the Gal4p 
transcriptional activation domain, and blocks Gal4p's ability to recruit RNA 

20 polymerase II and other transcriptional machinery to the promoter. In this aspect of 
the invention, Gal4p, or a fragment thereof, does not interact directly with the 
promoter; rather, the transcription factor, or DNA-binding domain thereof, to which 
the Ga|4p or fragment thereof is fused binds to the promoter that is sensitive to or 
regulated by said transcription factor. Nevertheless, the Gal4p or fragment thereof 

25 (preferably a fragment that interacts with GalSOp) can, for example, recruit RNA 
polymerase II to the promoter that in the absence of GalSOp repression. Thus, in the 
presence of an inducing agent such as galactose, GalSOp binds to Gal3p molecules 
that are normally confined to the cytoplasm, resulting in insufficient GalSOp in the 



nucleus to inhibit the Gal4p-based repression of the transcription factor to which it is 
fused causing, in this example, expression of the gene or coding sequence of 
interest. Similarly, the transcription factor to which Gal4p or a fragment thereof is 
fused can be a transcription activator. Transcription factors useful in the practice of 
5 this aspect of the invention include but are not limited to NF6B, AP-1, AP-2, Sp-1, 
CRE, CREB, Myo-Dl, TBP, TFIIH, ER, STAT-1, and STAT-2. 

[099] The following examples illustrate certain aspects of the above-described 
invention and advantageous results thereof. The following examples are shown by 
way of illustration and not by way of limitation. The disclosure of each reference 
10 cited herein is explicitly incorporated by reference. Examples in yeast are provided 
as background for the present invention relating to multicellular organisms or cells 
therefrom. 

EXAMPLE 1 

15 

Preparation of Plasmids Containing IVIembrane Targeting Sequences and 
Localization of Fusion Protein Expression in Yeast 

[0100] To determine if GalSOp would interact with a membrane-bound binding 
20 partner, Gal3p (GalSOp's natural binding partner) was tethered to the cell and 
mitochondrial membranes. In addition, the effects of Gal3p sequestration on 
galactose induction of GAL gene expression were examined. 

[0101] Gal3p-GFP was targeted to the cytoplasmic or mitochondrial outer 
membranes by fusion to a protein myristoylation signal (Myr-Gal3p-GFP) or 
25 mitochondrial outer membrane signal anchor sequence {M0M-Gal3p-GFP), 

respectively. N-myristoylation (Johnson eta/., 1994, Annu. Rev. Biochem. 63: 869- 
914; Resh et a/,, 1999, Biochem. Biophys. Acta. 1451 : 1-16) was chosen because it 
is a co-translational process that occurs when the nascent peptide is still attached to 
the ribosome. Mitochondria targeting was chosen because it appears to be a very 
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fast and efficient process (Fujiki and Verner, 1993, J. Biol. Chem. 268: 1914-20; 
Funfscliilling et ai, 1999, Mol. Biol. Cell. 10: 3289-99). 

(0102] Gal3p derivatives (including Gal3p-GFP derivatives) bearing protein N- 
myristoylation signals were generated by inserting oligonucleotides encoding each 

5 signal peptide sequence into GALS gene at the N terminus. The N-myristoylation 
signals used herein are: MGCTVSTQTIGDESDP (SEQ ID N0:1), obtained from the N- 
terminal domain of Gpalp (a G protein a subunit), a N-myristoylation signal variant 
MACTVSTQTIGDESDP (SEQ ID N0:2) (the single G^A substitution abolishes the 
acylation and membrane targeting), or the mitochondria outer membrane signal 

iO anchor sequence (the first 29 amino acid residues of Tom70p 

(MKSFITRNKTAILATVAATGTAIGAYYYY (SEQ ID N0:3)), a component of the 
translocase of the outer mitochondrial membrane complex). It has been shown that 
the N-myristoylation signal used here is sufficient to target a heterologous protein to 
the plasma membrane (Gilen etai, 1992, J. Cell. Sci. Ul: 323544). The first 29 

15 amino acids of Tom70p protein is sufficient to target a heterologous protein to the 
outer membrane of mitochondria, leaving the bulk of protein extruding into the 
cytoplasm (McBride et a/., 1992, 1 Cell. Biol 119: 1451-1457). The ADH2 
promoter, FPRl, CNAl, H0M3 and CMDl sequences were PGR amplified using 
yeast Sc723 genomic DNA as the template and oligonucleotide primers disclosed in 

20 the Examples below. Green fluorescence protein (GFP) and cyan fluorescence 
protein (GFP) sequences were PGR amplified using pYGFPl (Cormack etai, 1996, 
Gene 173: 33-38) and pDH3 (Yeast Resource Center, University of Washington) as 
template, respectively. 

[0103] Plasmid constructions to generate the Myr-Gal3, mMyr-Gal3 and MOM- 

25 Gal3 plasmids were prepared as follows. Myr-Gal3 and mMyr-Gal3 fusions were 
constructed in plasmid pTEB16, a low-copy number yeast shuttling plasmid having a 
TRPl marker (Blank et al. 1997, Mol Cell Biol. 17: 2566-75). A Spel site and a Psfl 
site were created at the N terminus of GAL3 in pTEB16 (Peng and Hopper, 2000, 
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Molec. Cell Biol. 20: 5140-8). The oligonucleotides GANG49/50 encoding 
MGCTVSTQTIGDESDP (SEQ ID N0:6) were annealed and inserted at Spe\/Pst\ site to 
generate Myr-Gal3 construct. An identical procedure was used to create a mMyr- 
Gal3 bearing plasmid. The M0M-Gal3 plasmid (containing MOM targeting sequence, 

5 HA tag, GFP and GALS) was constructed in plasmid pTEB16, a low-copy number 
yeast shuttling plasmid having a TRPl marker (Blank etai, 1997, Mo/. Cell. Biol. 17: 
2566-75). A Spel site and a PstI site were created at the N terminus of GAL3 in 
pTEB16 (Peng and Hopper, 2000, Mol. Cell. Biol. 20: 5140-8). The oligonucleotides 
GANG53/55 and GANG54/56 were annealed separately and then inserted at 

10 Spel/PstI site to add the MOM targeting sequence. The oligonucleotides 

GANG59/60 were annealed and inserted at the PstI site to add the hemagluttinin 
(HA) tag. The PGR product using pYGFPl (Cormack etai, 1996, Gene 173: 33-8) 
as template and GANG31/61 as primers was cut with PstI and inserted at PstI site to 
add the GFP sequence. 

15 [0104] The oligonucleotides and the references relating to the constructions of 
the Myr-Gal3-GFP, mMyr-Gal3-GFP and M0M-Gal3-GFP plasmids are provided below. 



Table 1: Oligonucleotide Primers 



GANG3T(SEQ 
ID N0:7) 


Pstl-GFP forward 


AAC TGC AGG TAT GTC TAA AGG TGA 

AG 


GANG49 (SEQ 
ID N0:8) 


Myr/palm signal-1 


CTA GTA TGG GGT GTA GAG TGA GTA CGC 
AAA CAA TAG GAG AGG AAA GTG ATC CU 
CT(B CA 


GANG50 (SEQ 
ID N0:9) 


Myr/palm signal-2 


GAA GGA TGA CTT TOG TOT CCT AH GTT 
TGC GTA CTC ACT GTA CAC CCC ATA 


GANG53 (SEQ 
ID N0:10) 


Tom70 (29)-l- 
sense 


CTA GTA TGA AGA GGT TCA HA CAA 
GGA ACA AGA CAG CCA TTT TGG CAA 


GANG54 (SEQ 
ID N0:11) 


Tom70(29)-2- 
sense 


CCG no CTG CTA CAG GTA CTG CCA 
TCG GTG CCT ACT AH ATT ACG GTG 
CTG CA 


GANG55 (SEQ 
ID NO: 12) 


Tom70(29)-1- 
antisense 


GCA ACG GTT GCC AAA ATG GCT GTC 
TTG nc CTT GTA ATG AAG CTC TTC 
ATA 
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GANG56 (SEQ 
ID N0:13) 


Tom70(29)^2- 
antisense 


GCA CCG TAA TAA TAG TAG GCA COG 
ATG GCA GTA CCT GTA GCA 


GNAG59 (SEQ 
ID N0:14) 


Linker/HA for 
Tom70-3 - sense 


CAG TTG GGT GGT GGT GGT CGT TAC 
CCA TAC GAC GTC CCA GAC TAC GOT 
GCA 


GANG60 (SEQ 
ID N0:15) 


LinkerAIA for 
Tom70-3 - 
antisense 


GCG TAG TCT GGG ACG TCG TAT GGG 
TAA CGA CCA CCA CCA CCC AAC TGT 

GCA 


GANG61 (SEQ 
ID N0:16) 


Pstl-GFP w/o stop 


AAC TGC AGA TTT GTA CAA HC ATC 

CAT AC 


GANG113 
(SEQ ID NO: 
35) 


Cmdl-Sphl/Clal- 
forward 


AAA AAA GTA CAg cat gca aAT GTC 
CTC CAA TCT TAC CGA AG 


GANG114 
(SEQ ID NO: 
36) 


Cmdl-Sphl/Clal^ 
reverse 


GAT GCA CCT Aat cga ttT HA GAT AAC 
AAA GCA GCG AAT TG 



[0105] After addition of these respective targeting signals, Myr-Gal3p-GFP 
expressed using these constructs localized to the plasma membrane and MOM- 

5 Gal3p-GFP expressed using these constructs localized to the mitochondria. In 
contrast, wild type Gal3p-GFP was shown to be located throughout the cytoplasm, as 
was Gal3p-GFP joined to the variant N-myristoylation signal having a single amino 
acid substitution that cannot be acylated. These results are shown in Fig. lA, The 
expression patterns of Myr-Gal3p-GFP and M0M-Gal3p-GFP were examined by 

10 fluorescence microscopy using a Nikon Optiphot-2 epifluorescence microscope 

equipped with a lOOX objective. GFP was excited and its emission fluorescence was 
detected using Chroma filter # 41017 (Chroma Technology Corp., Brattleboro, VT). 
Images were acquired using a SenSys KF1400 CCD camera (Photometries Ltd, 
Tucson, AZ), controlled by QED (Pittsburgh, PA) software. 

15 [0106] Thus, fusion of Gal3p to membrane targeting sequences resulted in 
appropriate localization of Gal3p at the cell and mitochondrial membranes. 

EXAMPLE 2 
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Membrane Associated Gal3p Retains Ability to Bind GalSOp and Induce GAL 
Gene Expression in Yeast 

[0107] To verify appropriate activity of Myr-Gal3p or M0M-Gal3p in yeast cells 

5 harboring the plasmids encoding these fusion proteins, semi-quantitative colony 
growth assays and galactose-responsive reporter gene analyses were performed. 
Following verification of appropriate Gal3p expression, the distribution of GalSOp in 
Myr-Gal3p or M0M-Gal3p expressing cells was examined by fluorescence 
microscopy and Western blot analysis. 

10 [0108] Gene expression was determined for two different types of GAL gene 
promoters in cells carrying the membrane bound and cytoplasm sequestered Gal3p. 
First, a sensitive and semi-quantitative colony growth assay was used to assess 
expression of a HIS3 reporter gene whose promoter bears four UASqal sites. Cells 
of a gal3AgallA strain (Sc781) carrying Myr-Gal3p or M0M-Gal3p grew 

15 indistinguishably from cells harboring wild type Gal3p on synthetic medium lacking 
histidine in these assays. Because growth rates are proportional to the level of HIS3 
gene expression and HIS3 gene expression is proportional to the level of Gal3p 
activity, we concluded that Myr-Gal3p and M0M-Gal3p are as active as native Gal3p 
(data not shown). Second, expression of MELl, a galactose-responsive gene whose 

20 promoter bears a single UASqal site was quantified. See Lazo at al., 1977, Eur. J. 
Biochem. IT. 375-382; Buckholz and Adams, 1981, Molec. Gen. Genet. 182: 77- 
81; Post-Beittenmiller etai, 1984, Molec. Ceil. Biol. 4: 1238-1245. 

[0109] Despite effective sequestration of Gal3p to the cell cytoplasmic 
membranes, the MELl gene was expressed up to wild type levels upon galactose 

25 induction (these results are shown in Fig. IB). The level of MELl-encoded a- 
galactosidase in the extracts of cells carrying Myr-Gal3p was not significantly 
different from that detected in extracts of cells carrying wild type Gal3p (p>0.1. 
Student's t-test), or carrying a mMyr-Gal3p variant that abolished plasma association 
(p>0.1). The mitochondria associated Gal3p (M0M-Gal3p) gave rise to higher MELl 
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expression than the wild type Gal3p did (p<0.05). Using Western analyses, it was 
found that membrane association of Gal3p did not increase steady state levels of 
Gal3p. Cellular levels of Myr-GaISp and the mMyr-GaISp variant were slightly lower 
than that of wild type GaISp, while the M0M-Gal3p level was similar to that of wild 
5 type Gal3p (data not shown). Thus, Myr-Gal3p, mMyr-Gal3p and M0M-Gal3p are all 
synthesized to similar levels. 

[01 10] Several lines of evidence indicated that the full induction of GAL genes 
observed in cells harboring membrane-bound Gal3p was due to the interaction 
between Gal3p and GalSOp at the cytoplasmic membranes. First, the targeting 

10 methods employed were very effective in sequestering Gal3p to the membranes. 
Based on cell fractionation and Western analyses, essentially all the Gal3p in these 
cells was associated with the membrane when expressed joined to a myristoylation 
signal. For the mitochondria-targeted Gal3p, less than about 10% of M0M-Gal3p was 
found in the soluble cytoplasmic fraction (data not shown). Second, GalSOp 

15 redistributed to the plasma membrane and the intracellular vesicle membrane in cells 
expressing Myr-Gal3p (shown in Fig. 10. Lastly, a small fraction of Gal3p that might 
have escaped the cytoplasmic sequestering methods used herein would not have 
been sufficient to give rise to the levels of GAL gene expression observed in these 
experiments (shown in Fig. 2). Three plasmids providing different subcellular 

20 locations and expression levels of Gal3p were constructed and expression levels of 
the MELl gene in cells carrying these plasmids were determined. The results 
showed that a five-fold reduction of GaISp resulted in a two-fold reduction in MELl 
gene expression level, indicating that Gal3p is not in large excess in the cells and 
that quantitative interactions between GaISp and GalSOp are required for the 

25 transcription of GAL genes. In comparison, a two-fold reduction of Gal3p derivative 
was observed in cells carrying Myr-GaISp and resulted in 20% reduction in MELl 
gene expression level. 



56 



[01 11] In summary, these results demonstrated that Gal3p maintains its induction 
function and its induction capacity and that active transcription of GAL genes in cells 
harboring Myr-Gal3p is due to the Gal3p-Gal80p interaction at the plasma and 
vesicular membranes outside the nucleus, rather than to GaISp interaction with the 
5 Gal80p-Gal4p complex in the nucleus. Additionally, the interaction between GalSOp 
and membrane bound GaISp resulted in the appropriate activation of GAL reporter 
gene expression within the nucleus. 

EXAMPLE 3 

10 

GalSOp Association with Gal4p within Gene Promoters is Reduced in Yeast 

[0112] Because the association of GalSOp with Gal4p at a GAL gene promoter is 
essential for inhibition of Gal4p in the absence of galactose, the effect of galactose 

15 (which triggers the interaction between GaISp and GalSOp) on the amount of GalSOp 
complexed with DNA-^bound Gal4p was determined (Torchia et al., 1984, Mol. Cell 
Biol. 4: 1521-1527; Lohr et al., 1987, J. Biol. Chem. 262: 15589-15597). To 
determine the extent of Gal4p and GalSOp association with the UASGAL region of the 
GALI/GALIO gene promoter, formaldehyde-based in vivo cross-linking assays were 

20 performed followed by chromatin immunoprecipitation. 

[01 13] Chromatin immunoprecipitations were performed as described by 
Braunstein eta/., 1993, Genes Dev. 7: 592-604 and Kuras etal. 1999, Nature 399: 
609-613. Briefly, wild type Sc723 cells (Blank et al., 1997, Mol. Cell. Biol. 17: 
2566-2575) were grown to early exponential growth phase (as measured by 

25 absorbance at 600nm -0.4) in 250mL yeast extract-peptone medium containing 2% 
glycerol, 3% lactic acid, and 0.05% glucose (-galactose, un-induced). For galactose- 
induced cultures, galactose was added to the un-induced cultures to a final 
concentration of 2% and the cultures were incubated for an additional 20 minutes. 
Formaldehyde was added to a final concentration of 1%, the cultures were incubated 

30 for 20 minutes at room temperature, and glycine was added to a final concentration 

57 



of 300mM. All the following steps were carried out at 4°C. Cells were washed twice 
with TBS (20mM Tris-HCI, 150mM NaCI, pH 8.0), resuspended in ImL lysis buffer 
(50mM Tris-HCI pH 8.0, lOmM EDTA, 1% SDS, ImM PMSF), and disrupted by 
vortexing with glass beads. The cross-linked chromatin was pelleted by 

5 centrifugation at 200,000 gfor 12 minutes, resuspended in ImL lysis buffer, and 
sonicated to yield DNA fragments of 300 base pairs average size. Soluble chromatin 
was separated from insoluble materials by centrifugation at 13,000 g for 15 
minutes. Finally, the volume of soluble chromatin was adjusted to lOmL with dilution 
buffer (16.7mM Tris-HCI, pH 8.0, 16.7mM NaCI, 1.2mM EDTA, 0.01% SDS, 1.1% 

10 Triton-XlOO, ImM PMSF) and the resultant chromatin solution was stored in ImL 
aliquots at -80° C. For immunoprecipitation assays, 5^L rabbit anti-Gal4p serum, 5 
laL anti-Gal80p serum, or 5 jxL anti-Gal3p serum was added to ImL chromatin 
solution. After overnight incubation on a rotator, 75 |iL of protein-A-Sepharose 
beads were added, and the reactions were incubated for two more hours. 

15 Precipitated protein-DNA complexes were eluted from the beads, cros$-linked 
reversed, treated with proteinase K, and analyzed by quantitative radioactive PCR 
using primer pairs GANG67/68 for the UASgal region and GANG71/72 for an 
intergenic control region located 5 kb downstream from the GALl gene. The 
sequences of the primers were GANG67 : CATCGCAHACCACCATATACATATCC (SEQ 

20 ID N0:17); GANG68: GAAGGTTTGTGGGGCCAGGTTACTGC (SEQ ID N0:18); GANG71: 
GTGCATTTGGCCTTCAATGAGC (SEQ ID NO: 19); GANG72: 
AAGTGATGTTCGACATACCTGTAAC (SEQ ID N0:20). A dilution (1/30,000) of input 
DNA, 1/200 of anti-Gal4p antibody precipitated DNA, 1/50 of anti^GalSOp antibody 
precipitated DNA, and 1/25 of anti-Gal3p antibody precipitated DNA were used as 

25 template for the various reactions. PCR conditions were as follows: 0.5|aM each 
primer, 0.25mM each dNTP, 1.5mM MgCl2, 0.06 mCi/mL (a-^^P)dATP in a 20^L 
reaction volume. PCR cycle regimen was as follows: 3 minutes at 94°C, followed by 
25 cycles of 1 minute at 94°C, 1 minute at 52°C, and 1 minute at 72''C. PCR 
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products were resolved on 8% polyacrylamide gels in IX TBE buffer. Quantitation of 
incorporated {a-^^P}dATP in the PGR products was performed using a 
Phosphorlmager (IVIolecular Dynamics) analysis. 

[0114] Because GalSOp is associated with the UASg/u. solely through its binding 
5 to Gal4p, comparison of the amount of the UASgal DNA associated with Gal4p and 
GalSOp provided an assessment of the interaction between Gal4p and GalSOp. It 
was found that, while Gal4p occupancy of the UASgal site slightly increased upon 
galactose signaling, the GalSOp association with the UASgal region decreased 
significantly. Repeated trials did not yield any evidence that Gal3p is associated with 

10 the GALl/IO gene promoter (results shown in Fig. 3). These data indicated that 
concurrent with galactose-induced interaction between GaISp and GalSOp in the 
cytoplasm, the association between GalSOp and UASc/iL-bound Gal4p decreases in 
the nucleus. These results were contrary to a long-held view (Leuther & Johnston, 
1992, "Non-dissociation of Gal4 and GalSO in vivo after galactose induction," 

15 Science 256 : 1333-1334) that GalSO does not dissociate from Gal4 in response to 
galactose. 

EXAMPLE 4 

20 GAL Gene Expression Activated by Surrogate Protein-Protein Interactions in 
the Cytoplasm: The 80-Trap 

[01 15] The results shown in Examples 1-3 above indicated that the galactose- 
triggered Gal3p-Gal80p interaction traps GalSOp in the cytoplasm, thereby causing 
25 the observed removal of GalSOp from the promoters of GAL genes and activation of 
Gal4p. The behavior of the Gal80p-Gal3p interaction in vivo is consistent with this 
view, as the membrane-bound Myr-Gal3p is shown to sequester GalSOp to the 
membranes where Myr-Gal3p is located (as shown in Fig. IC). 

[0116] Given the new insights into the activation of Gal4p shown here, a method 
30 for trapping GalSOp in the cytoplasm was developed for use as an assay to detect 
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protein-protein interactions in the cytoplasm and/or nuclear export/localization 
sequences and to regulate Gal4p-activation of GAL gene promoters by small 
molecules other than galactose. In this method, termed the 80-Trap method, a 
specific bait protein (identified as a known protein of interest) is fused to GalSOp to 
5 create a fusion protein with Gal80 at either the amino- or carboxyl-terminus of 
GalSOp. The bait protein's binding partner, prey protein (either another known 
protein of interest or a multiplicity of yet unidentified potential partners encoded in a 
cDNA library), is tethered to the cell or mitochondrial outer membrane via fusion to 
membrane targeting sequences (myristoylation or mitochondrial Tom70 sequence, 

10 respectively). If the bait and prey proteins interact in the cytoplasm, Gal80 is 
trapped in the cytoplasm and prevented from inhibiting Gal4p-mediated 
transcriptional activation of GAL gene promoters in the nucleus. 

[0117] Using this method, it was expected that substituting Gal3p with a 
surrogate-binding factor for GalSOp in the cytoplasm would activate GAL gene 

15 expression. To test this possibility, the interaction between Fprlp and Canlp that is 
, elicited by a small membrane-permeable molecule FK506, a macrolide isolated from 
Streptomyces tsukubaensis (Kino etai, 1987 , J Antibiot (Tokyo) AO: 1249-1255; Liu 
etai., 1991, Cel/ 66: 807-815; Cardenas etai, 1995, £mbo J 14: 2772-2783; and 
Griffith et a/., 1995, CeW 82: 507-522) was used. Expression constructs and 

20 production of yeast strain ScGP786 was as is described below. 

[01 18] pGP82 (GalSO-Fprl) is made up of nucleotide sequences encoding a 
GAL80, 2 HA tags, and FPRl on a pRS416 (Sikorski and Hieter, 1989, Genetics 
122 : 19-27) backbone. Gfil.80 sequence was obtained from yeast genomic clones 
using polymerase chain reaction, and having the sequence disclosed at 

2-5 • http://genome-www4. stanford.edu/egi-bin/SGD/locus. Dl?locus=gal80. HA was 

derived from oligonucleotides GANG33/34. FPRl was the Clal digested PGR product 
generated by using yeast genomic DNA as template and GANG117/11S as primers. 
pGP130 (Myr-Hom3) contains an ADH2 promoter, CFP sequence, and H0M3 



sequence on a pRS424 (Sikorski and Hieter, 1989, Genetics 1£2: 19-27) backbone. 
The /\DH2 promoter was the PGR product using yeast genomic DNA as template and 
GANG83/84 as primers. CFP was the PGR product using pDH3 as template (Yeast 
Resource Center, University of Washington) and GANGlOl/112 as primers. H0M3 

5 was a PGR product using yeast genomic DNA as template and GANG125/126 as 
amplification primers. pGP139 (Myr-Cnal) contains an ADH2 promoter, GFP 
sequence, and GNAl sequence. An ADH2 promoter was obtained as a PGR product 
using yeast genomic DNA as template and GANG83/84 as primers. GFP was 
obtained as a PGR product using pDH3 as template (Yeast Resource Genter, 

10 University of Washington) and GANGlOl/1 12 as primers. GNAl was obtained from 
a PGR product using yeast genomic DNA as the template and GANG 141/1 42 as 
primers. 

[0119]ScGP786 was derived from Sc786 (Blank etai, 1997, Mo/. Cell. Biol. 17: 
2566-2575). For the replacement of FPRl gene with ADEl, ADEl ORF under control 

15 Of a GALl/10 gene promoter was cloned into plasmid pUG6 carrying the dominant 
Kan" gene (Guldener et a/., 1996, Nucleic Acids Research 24: 2519-2524). Yeast 
cells were transformed by the PGR product from the reaction using oligonucleotide 
primers GANGl 19/120 with 45bp homology to FPRl chromosomal locus at 33 nt 
upstream and 33 nt downstream. Transformation and homologous recombination 

20 events were selected by plating cells on YPD plates containing 200mg/L Geneticin 
(G418 sulphate). 

[0120] FK506 was used to trigger the interaction between a Gal80-Fprlp fusion 
and plasma membrane-localized Myr-Ganlp. It has been shown that FK506 by itself 
does not activate transcription of the GAL genes (Biggar and Grabtree 2001, Embo. 
25 J. 20:3167-3176). Induction of the galactose-inducible HISS reporter gene was 
assayed. Cells of a galSOA galSA gall A strain (ScGP786) carrying both Gal80-Fprl 
and Myr-Canip plasmids exhibited evident growth only on the plate containing FK506 
(shown in Fig. 4A). It has been reported that Fprlp also interacts with Hom3p, the 
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aspartokinase, in the absence of FK506 in a yeast two-hybrid assay (Alarcon and 
Heitman, 1997, Mol. Cell Bio., 17: 5968-5975). Following a scheme that paralleled 
the one above, it was found that the heterologous interaction between Gal80-Fprlp 
and Myr-Honn3p was able to activate HIS3 reporter gene expression (shown in Fig, 

5 4B). It Was also determined by the Western analyses that the GalBO-Fprlp fusion 
was present at a slightly higher level in cells containing both GalSO-Fprlp and Myr- 
Hom3p as compared to cells harboring the non-interacting controls, GalSO-Fprlp 
and Myr-Cmdlp, indicating that activation of the HIS3 reporter gene was not due to 
the degradation of GalSO-Fprlp in cells expressing Myr-Hom3p. 

10 [01 21] The above results demonstrated that Gal4p-mediated gene expression 
may be activated by the cytoplasmic interaction of surrogate proteins. Thus, the 
potential interaction between proteins of interest and uncharacterized cDNAs may be 
detected with this novel cytoplasmic based system. In addition to the 'forward' two- 
hybrid modes described above, the system may also be used effectively in the 

15 'reverse' two-hybrid mode (analogous to the use of the conventional two hybrid 
system as shown in Vidal etal., 1996, Proc. Natl. Acad. Sci. USA 93:10315-20). 
The 80-Trap system may select for loss of interaction mutations in either cDNA (bait 
or prey) and provides a method for identifying and mapping the amino acids involved 
iri protein-protein interaction. 
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EXAMPLE 5 

Detecting Transcriptional Proteins Interactions in Yeast With The 80-Trap 
Method 



[0122] In conventional yeast two-hybrid methods, transcriptional activator 
proteins can activate two-hybrid reporters through either direct binding to the RNA 
polymerase or through binding to other proteins that in turn bind to an RNA 
polymerase subunit. The protein interactions for the 80-Trap occur in the cytoplasm 
30 and would thus circumvent the false activation of GAL promoters. To examine the 
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actions of transcriptional proteins in the 80-Trap system, the transcriptional protein, 
TATA binding protein (TBP), was fused to GalSOp. The N-terminal domain (amino 
acids 1-71) of TAF145 was fused to the myristoylation sequence. TAF145 and TBP 
are established components of the transcriptional machinery and are known to 
5 interact via proteiniDrotein interactions (Kokubo, et a/., 1998, Mol. Cell. Biol. 189 : 
1003-1012). Heterologous interaction between the GalSO-TBP and Myr-TAF145 was 
able to activate the HIS3 reporter gene expression (shown in Fig. 5, line 1). 

[0123] In a subsequent study, the transcription activation domain (AD) of Gal4p 
was tethered to the cell membrane by fusion to the myristoylation signal. Cells 

10 harboring the Myr-Gal4 (AD) fusion deplete GALBOp from the nucleus and in turn 
permitted the native Gal4p to activate transcription of GAL reporter genes (as shown 
in Figure 6). Similarly, tethering of any one of a multiplicity of other transcription 
activation domains outside of the nucleus and its interaction with a binding partner 
fused to GalSOp would result in reporter gene activation. 

15 [0124]These results demonstrated that transcription factors, and transcription 
activator domains may be tethered to the cells plasma and vesicular membranes or 
mitochondrial outer membrane and used to accurately screen for potential protein- 
protein interactions. This method permits the study of protein interactions not 
suitable for conventional two-hybrid approach. Because protein interactions occur in 

20 the cytoplasm, the potential for false activation of reporter genes is reduced, and the 
potential for detecting protein interactions when one or both of the proteins require a 
cytoplasmic location is optimized. 



Table 2: Oligonucleotide 


^rimers 


GANG33 (SEQ ID NO: 
21) 


CGT TAG CCA TAG GAG GTC CCA GAC TAG GCT 
GGTTGG 


GANG34 (SEQ ID NO: 

22) 


CGC CAA CCA GCG TAG TCT GGG ACG TCG TAT 

GGG TAA 


GANG83 (SEQ ID NO: 
23) 


GAT ACT TCC CAA TTC GTC TTC AGA G 
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GANG84 (SEQ ID NO: 
24) 


CTG GAA TAG ACT AGT TGT GTA HA CGA TAT 
AG 


GANGlOl (SEQ ID NO: 

25) 


CCA ATG CAT GTA TGA GTA AAG GAG AAG AAC 
III TCA C 


GANG113(SEQ ID NO. 
35) 


AAA AAA GTA CAg cat gca aAT GTC CTC CAA TCT 
TAC CGA AG 


GANG114(SEQ ID NO: 
36) 


GAT GCA OCT Aat cga ttT Uk GAT AAC AAA GCA 
GCG AAT TG 


GANG112(SEQID NO: 
26) 


TTT GTA Hg CAT gcg gat cGG GGA TC 


GANG117(SEQ ID NO: 
27) 


ACA AGT AAT Aat cga teg TCT GAA GTA AH GAA 
GGT AAC 


GANG118(SEQIDN0: 
28) 


TTT GCT TTT Aat cga ttG UG ACC JIC AAC AAT 
TC 


GANG119 (SEQ ID NO: 
29) 


ATA AAC TCG TGA AAG CTJ AAA GTA AGG CCT 
TTC ACC TAA ACT CGA GTC GU AGA ACG CGG 

CTA C 


GANG120 (SEQ ID NO: 
30) 


TCA ATT AAG GCT CAG ATA CTT ACC ATA AAC 
ATA AAT AAA AAG CAG TCA CTA TAG GGA GAC 
CGG CAG 


GANG125 (SEQ ID NO: 
31) 


TTT AAC TTT TAC gca tgc aaA TGC CAA TGG AH 

TCC AAC CTA C 


GANG126 (SEQ ID NO: 
32) 


GGT GGA m Agg cgc ctA ATT CCA AGT CTT 

TTC AAT TGT TC 


GANG141 (SEQ ID NO: 
33) 


CAA CGC CAg CAT Get gTC GAA AGA CH GAA 
TTC TTC ACG C 


GANG142 (SEQ ID NO: 
34) 


GTG CTT AGa teg att aCG TTT CAT TCA AAC CH 
CAG TCC C 



EXAMPLE 6 

5 Detection of Nuclear Export Sequences and Nuclear Localization Sequences 
in Yeast by Fusion to GalSOp 

[0125] In addition to measuring protein-protein interactions, the methods of the 
present invention are useful for detecting peptides containing nuclear export or 
10 nuclear localization sequences. For detecting nuclear export sequences, GalSOp is 
fused to a cDNA or one or a plurality of members of a cDNA library. cDNAs 
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encoding peptides containing a nuclear export sequence (NES) sequester GalSOp to 
the cytoplasnri. This in turn permits Gal4p-mediated transcriptional activation of GAL 
gene promoters in the presence of 0.01% galactose, a level normally not sufficient 
to permit Gal4p-mediated transcriptional activation of GAL gene pronioters. (shown 
5 in Figure 7). 

[0126] For detecting nuclear localization sequences, cDNAs encoding proteins 
containing an NLS sequence are fused to GalSOp. The presence of an exogenous 
nuclear localization sequence in said protein results in sequestration of GalSOp in the 
nucleus and prohibits export of GalSOp to the cytoplasm, resulting in the inability to 

10 bind to Gal3p in the cytoplasm when galactose is present. This inability of GalSOp to 
exit from the nucleus and bind to Gal3p allows GalSOp to maintain its high 
concentration in the nucleus and binding to Gal4p and prevents Gal4p-mediated 
activation of genes whose expression is regulated by galactose and Gal4p. In yeast 
cells harboring a Ura3 gene expressed from a GAL gene promoter, said fusion of an 

15 NLS sequence to GalSOp prevents the URA3 enzyme from being produced and 
consequently protects the cell from the toxic effects of 5-FOA. The yeast cells 
harboring said NLS-GalSOp fusion thus grow as colonies on nutrient agar containing 
3% glycerol/2% lactic acid, 0.1% galactose as carbon source and 5-FOA . In 
summary, the methods of the present invention provide a means for detecting both 

20 nuclear export and nuclear localization sequences. 

Example 7 

Construction of Plasmid Expressing GFP Fusion Proteins for Experiments In 
25 Cells from Multicellular Organisms 

[0127] A full-length 1559-bp yeast GAL3 ORF from pTEB16 (Blank et. al., Mol Cell 
Biol. 1997) was PCR-amplified with Pfu DNA polymerase (Invitrogen) and primers 
VPOOOl and VP0005 that created Sac II and BamHI sites upstream and downstream 
30 of the GALS ORF accordingly. The GAL80 ORF sequence was amplified from 
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PGP15A (Peng, G. and Hopper, J.E., Mol Cell Biol, 2000) using prinfiers VP0003, 
VP0006 that created Sac II, BamHI restriction sites flanking the GAL80 ORF 
sequence. The recipient plasmid was pEGFP-NI from (CLONTECH Laboratories, Inc) 
which contains the enhanced GFP ORF sequence expressed from the CMV immediate 

5 early promoter and transcriptionally terminated by the SV 40 early mRNA 

polyadenylation signal. Amplified GAL3 and GAL80 sequence were inserted to the 
pEGFP-NI plasmid that had been digested with Sacll and BamHI. The resulting 
plasmids pVRP2 and pVRP4 had GFP fused to the COOH terminus of GAL3 and 
GAL80, respectively. 

10 [0128] Primers: 

[0129] VPOOOl - 5' AATACCGCGGATGAATACAAACGnCCA 3' (SEQ \D 
N0:4), 

[01 30] VP0005 - 5" AATAGGATCCGCnGTTCGTACAAACAAGT 3' (SEQ ID 

N0:5), 

15 [0131] VP0003-5' AATACCGCGGATGGACTACAACAAGAGA 3' (SEQ ID 
N0:37), 

[01 32] VP0006 - 5 ' AATAGGAtCCGCTAAACTATAATGCGAGAT 3' (SEQ ID 
N0:38). 

20 Cell Culture, Transfection and Microscopy for Experiments in Cells from 
Multicellular Organisms 

[0133] HEK 293 and 3T3 cells were cultured at 37°C in 5% CO2 atmosphere in 
Dulbecco's modified Eagle's medium (DMEM) supplemented with 10% (v/v) fetal calf 
25 serum (Sigma) containing penicillin and streptomycin. As 3T3 cells required the 
additional supplement of glutamine, Transfections were carried out using the 
lipofectAMINE™ 2000 REAGENT (Invitrogen Life technologies). Plasmid DNA (8Dg) 
and 15 Dl of lipofectamine were added to cells on 35 mm culture dishes according 
to the manufacturer's instructions. Typically, HEK 293 and 3T3 cells at 60±70% 
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confluency in medium containing 2 ml of complete DIVIEI\^ were transfected as 
above. This routinely resulted in 20-40% transfection efficiency, as determined by 
visualization of expressed GFP by fluorescence microscopy of live cells 24 h after 
transfection. For live cell microscopy the cells were grown on glass cover slips in 35 
5 mm plates. 

Fluorescence Imaging Gal3-GFP and Gal80-GFP Fusion Proteins In 
Multicellular Organisms or Cells Therefrom 

10 [0134] For imaging GFP-fusion proteins in fixed cells the protocol of Dundr, M., et 
al., (2000 J Cell Biol 150 p433446) was used. After transfection of cells with 
plasmid expressing the GFP fusion protein of interest, the cells were seeded on 
sterile cover slips and grown typically 16-24 hours. The cells were fixed cells in 2% 
formaldehyde in PBS, pH 7.4 for 15 min. at room temperature directly on the cover 

15 slip, quickly rinsed in 4 ml PBS, pH 7.4 and washed in 2 ml PBS, pH 7.4, 2X5 min. 
at room temperature. The cover slip containing the fixed cells was positioned face 
down onto a 10 Dl drop of mounting media on a glass slide. Imaging was performed 
with an Olympus Confocal Scanning Microscope using a lOOX objective. The 
evidence shows that 1) the GalSOp protein exists in both the nucleus and the 

20 cytoplasm (Figure 8), and ii) the Gal3 protein is located only in the cytoplasm, i.e., 
excluded from the nucleus (Figure 9). 

[0135] It should be understood that the foregoing disclosure emphasizes certain 
specific embodiments of the invention and that all modifications or alternatives 
equivalent thereto are within the spirit and scope of the invention as set forth in the 

25 appended claims. 



